DOCUMENT RESUME 



ED 046 628 



RE 003 196 



Gammon, Elizabeth Macken 

A Syntactical Analysis of Some First-Grade Readers. 
Stanford Univ. , Calif. Tnst. for Mathematical 
Studies in Social Science. 

TR-155 
22 Jun 70 
17 Ip . 

EDRS Price ME-$0.65 HC-S6.58 

*Basic Reading, Beginning Reading, Grade 1, 

^Grammar, Linguistics, Phrase Structure, ^Reading 
Materials, ^Structural Analysis, Structural Grammar, 
^Syntax 



Two widely used first-grade reading series, Ginn and 
Scott-Poresman , were analyzed in terms of six phrase-structure 
grammars, in an attempt to discover frequencies for sentence types. 

The six grammars were noun phrase, verb phrase, verbal modifier, 
statements without verbs, interrogative, and statements with verbs. 
Categorial grammars were written for comparison with noun-phrase and 
verb- phras-e— grammars. A quantitative method for evaluating linguistic 
grammars, developed from the analyses, provided a theoretical 
framework for accounting for various kinds of utterances. Chi-square 
analysis showed the grammars to be well-suited to both series, with 
the verbal modifier and statements without verbs providing the best 
fits. The categorial grammars were approximately equivalent to the 
phrase-structure grammars, making them possibly more useful for 
further study since they require fewer manipulations of the text. It 
was concluded that the method developed would be useful in further 
analysis of reading materials. Such analysis would be important in 
increasing the similarity of sentence structure in materials to that 
of children's speech. Tables and a bibliography are included. (MS) 
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CHAPTER I 

RATIONALE AND OVERVIEW 

An important problem at all levels of instruction is the appropriate 
matching of reading materials to the reader . ” Appropriate matching” refers 

to the reader’s ability to comprehend the materials and to the effectiveness 
of the materials in furthering some of the aims of the educational process) 
the term "reading materials 15 includes both materials which teach children 
to read and materials which convey other information. In the past the 
appropriateness of materials has been discussed in relation to their 
content, their format and organization, and their difficulty level as 
expressed by their vocabulary load, their sentence structure, and their 
level of human interest. (Chall, 1958) In the revision of primary readers, 
in particular, the concern has been with finding the appropriate level of 
difficulty as measured by the frequency of new vocabulary words and the 
number of repetitions of a new word. (Becker, 1936; Gates, 1930; Hockett, 

1938; Mehl, 1931; Spache, 194l) But for primary readers two other 
conceptions of "appropriateness” are important: l) How does the sentence 
structure found in the readers compare to the sentence structure of the 
child’s speech? and 2) How does the sentence structure found in the 
readers compare to that of correct adult speech which is one of the 
aims of education? 

Evidence for the importance of the first new conception of appropri- 
ateness, the similarity of the readers to oral speech, canes from four studies: 
Bormuth (1964), Ruddell (1964, 1965 )* and Strickland ( 1962 ),, Strickland 
conducted an extensive analysis of the patterns of sentence structure in the 

*The work reported here has been supported by the National Science 
Foundation (Grant G- 18709 )* Office of Education Contract 0EC-4-6-06l493“2089 
and the Ravenswood Subcontract to Stanford under Office of Education Grant 
OEG 0041, 



oral language of children in the first six grades of elementary school, and 
a partial analysis of the sentence structure of second- and sixth-grade 
readers. The purpose of the reader analysis was to determine whether the 
common speech patterns of children appeared in the readers; no effort was 
made to determine and compare the frequency of the patterns. Strickland 
found that many of the patterns most frequently used by children appeared 
in the readers, but that the patterns appeared in a somewhat random manner 
and that there appeared to be no scheme for development of control over 
sentence structure which paralleled the generally accepted scheme for the 
development of control over vocabulary. Strickland also found direct 
relationships between the development of a child 0 s oral language ability and 
his general reading ability, as measured by a variety of tests. Ruddell 
(1965) studied the relationship of selected language variables to reading 
achievement, and concluded that a child 9 s control over designated aspects 
of his morphological and syntactical language is significantly related 
to his reading ability as measured by paragraph meaning, sentence 
meaning, and vocabulary achievement scores * The Strickland study 
and the Ruddell (1965) study indicate a relation between stage of 
oral language development and reading ability; in the following studies 
by Bormuth (196*0 and Ruddell (196*4-) > the relationships between 
reading achievement and the similarity of reading material to the reader’s 
oral language were exploredo Bormuth used Strickland’s findings to rate 
reading passages according to their similarity to children’s speech. He 
found that this index was related to comprehension difficulty at low levels 
of reading achievement; that is, for children who have reading difficulties 
an increased similarity of the language in the reading material to their 
own language patterns increased comprehension. This relationship was not 
strong enough to be significant, but. in establishing his similarity index 
Bormuth used only the first level of Strickland^r analysis which ignored 
many of the substructures of language, and in addition used her combined 
oral speech data for first, third, and sixth graders while his subjects 
were fourth to eighth graders. If the similarity index had been based on 
more details of the language pattern and had been changed for each grade 
level to include the particular speech patterns of that level, the 
relationship would probably have been stronger, Ruddell ( 196*4-) incorporated 
these ideas. He constructed six reading passages using the language patterns 




which Strickland found to occur in the oral language of fourth graders; 
the patterns were used in the reading passages in the same proportional 
frequency in which they occurred in the oral language. Three of the reading 
passages included only high frequency patterns and three included only low 
frequency patterns. His subjects were fourth graders from the identical 
school from which Strickland obtained her oral language sample. Ruddell 
found that comprehension scores on the written material designed with high 
frequency patterns of oral language structure and comprehension scores on 
written material designed with low frequency patterns of oral language 
structure were significantly different beyond the .01 level in the 
expected direction. 

These studies indicate a definite relationship between reading ease 
and the similarity of written material to the reader’s spoken language, 
and suggest the importance of a further understanding of this relationship. 
For children who are having difficulties in learning to read, perhaps the 
best kind of remedial materials would be those which use only patterns 
of sentence structure which the child himself uses. Then only the act of 
reading and not the structure and sound of the material would be new to 
the students. 

The second concept of "appropriateness”, the similarity of the readers 
to correct adult speech, concerns additional goals of the primary reading 
program. Once the basic mechanics of reading have been established through 
material incorporating only the child’s language patterns, sentence patterns 
which are more complicated and more like those of adult speech could be 
introduced. Expanding the sentence structures of the readers in some 
systematic way would familiarize the child with adult speech and presumably 
develop his own oral language patterns. 

Much further study is' obviously required to explore these ideas, but 
certainly a first step is a complete analysis of the sentence patterns in 
reading material presently being used. Strickland’s analysis was partial 
in that she surveyed only parts of the readers and was interested only in 
the presence or absence of sentence types but not their frequencies of 
occurrence. Ruddell’ s (1964) study indicates the need for consideration 
of type frequency in determining similarity. A concise and complete 
representation of sentence patterns and their relative frequencies in grade 
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school readers is needed to begin further work in comparisons of oral 
language patterns, reading material, and reading ability. 

This suggests the linguists' approach to syntactical analyses 
through the construction of representative grammars. Several types of 
grammars are possible, for example, phrase- structure grammars, categorial 
grammars, and case grammars. And within each of these, a great number of 
versions could be written. The problem of finding the similarity of the 
sentence patterns in the readers to the sentence patterns of the child's 
speech requires not merely a grammar, but the best grammar in the sense 
that it most accurately represents the utterances in the corpus and their 
frequencies of occurrence. The problem of evaluating several grammars 
written for the same corpus has not been adequately solved. Traditional 
criteria for a "good" grammar are: the grammar generates all of the 

grammatical sentences and none of the ungrammatical ones, the grammar is 
in some sense simple, and the grammar is finite. (Chomsky, 1957) A 
grammar which satisfies these criteria but is clearly unsatisfactory is 
the following: 

Let N be the number of utterances in the corpus, and 
let a , a p , ..., a_ be the utterances. Let the 
generative rules of the grammar be: 

S — * Q-, 

S ? a 2 

S -^a^ 

A split-half analysis, in which the rules which generate perfectly one half 
of the corpus (the even numbered utterances, for example) are applied to 
the other half, would obviously show a poor fit. Quantitative methods for 
evaluating grammars are needed, not only for the analysis of primary 
readers, but for all applications of psycholinguistics. 

This dissertation will make contributions to botA areas outlined 
above. It will furnish a thorough syntactical analysis for some primary 
readers, and it will demonstrate a quantitative method for evaluating 
grammars written for a specific corpus. (Suppes, 1970) 
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The primary purpose of this study is to provide a syntactical 
analysis of two widely used first-grade readers, the Scott -Fo re sman series 1 
and the Ginn series. 1 A phrase- structure grammar will he written for the 
entire corpus and a categorial . grammar will he written for certain aspects 
of the corpus for purposes of comparison. The phrase- structure grammar is 
the more natural in that it uses terminology and constructions familiar 
from the study of Latin grammar; however, the categorial' grammar has the 
advantage of being more easily adapted to other corpuses. 

The study will demonstrate a quantitative method of evaluating 
grammars which provides a theoretical framework to account for the 
utterances and their frequencies of occurrence. A preview of the method 
will he given here, and complete details will he presented in the following 
chapters. The first step in the analysis is to obtain an overview of the 
syntactic types which appear in the corpus. This is done by coding each 
utterance as an ordered n-tuple consisting of the part of speech of each 
word in the order of occurrence. The part of speech is determined according 
to rules commonly used for English grammars; possible categories ares 
noun, transitive verb, intransitive verb, adverb, adjective, etc. The 
sentence, "Come and see the new hat.", would be coded as (rV+C+TV-HT+A+N) 
where "IV" stands for intransitive verb, "C" stands for conjunction, 

"TV" stands for transitive verb, "T" stands for article, "A" stands for 
adjective, and "N" stands for noun. The observed frequency of each 
syntactical type is then determined. 

A grammar is written which generates these types, and the quantitative 
evaluation of the grammar begins with the assignment of parameters to each 
of the choice points. For an example of "choice point" consider the 
following rewrite rule of a generative grammar: S-> G+TV+(T)+( £a^A$)+N, where 

"G" stands for proper noun, and the other letters are as defined above. 

The parentheses indicate that the contents may be deleted, and the brackets 



These texts are listed in the references according to the authors: 
Ousley and Bussell (l957j 196l), Robinson, Monroe, and Artley ( 1962a, 1962b, 
1962c, I962d), and Russell and Ousley (1957)* Throughout this paper these 
texts will be referred to by their reading level and publisher; this is the 
clearest and most widely used procedure. 
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indicate tha*c only one of the members may be used. The choices of this 
rewrite rule are 2 to include or delete the article, and to use zero, one, 
or two adjectives. These are referred to as "choice points", and each is 
assigned a parameter denoting the probability of that choice 0 

The parameters can then be used to state the theoretical probability 
of any syntactical type derivable from the grammar . The probability of 
type (G+TV+T+A+N) , as in "Tom hit the red ball.", is the probability of 
choosing "T" times the probability of choosing "A" times the probability of 
Selecting the particular rewrite rule given above if more than one is 
allowed by the grammar. If optional transformations are included in the 
grammar, parameters must be assigned to the options and must be 
incorporated into the theoretical probabilities. Obligatory transformations 
require no parameters. 

Using the theoretical probabilities and the observed type frequencies, 
the parameters can be estimated using the method of maximum- likelihood. 

This method is workable as long as the grammar is unambiguous; that is, as 
long as each syntactical type can be generated in only one way. If more 
than one derivation is possible for a syntactical type, the theoretical 
probability of that type consists of a sum of products and the maximum- 
likelihood calculations are extremely difficult. 

Once the parameters have been estimated, the estimated frequencies 
of each syntactical type can be calculated. Chi-square tests can then be 
used to determine the goodness-of-f it of the model (the grammar) to the 
observed frequencies of the syntactical types. 

Chapter II will give the details of all steps in the analysis 
preceeding the construction of the final grammars; Chapter III will 
present the phrase- structure grammars and their statistical analyses; 

Chapter IV will present the. categorial grammars and their analyses; and 
Chapter V contains a summary and some concluding comments. 
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CHAPTER II 

PRELIMINARY STEPS OF ANALYSIS 

The corpus for this study is the Ginn first-grade reading series 
and the Scott-Foresman first-grade reading series. These texts are widely 
used and appeared to be similar in terms of format, content, and sentence 
structure. The hope was that the data from the two series would combine 
to form large frequencies for all sentence types; although, as will be 
seen later, this did not prove to be precisely the case. Each first 
grade series is divided into a set of pie-primers, a set of primers, and 
a set of first readers \«hich are read by all students; all of these have 
been included in the analysis. Optional texts such as the Scott-Foresman 
Guess Who for slower learners were not analyzed. 

The texts are divided into short stories consisting mainly of 
conversations between persons or animals, so many sentences contain an 
identification of the speaker. Typical examples are: ,f,f I like this 

kitten, 11 said Betty . ,f and ,f Susan said, ,f Toy Mouse warts a ride."" Neither 
the titles of the stories nor the speaker identifiers (Betty said, he said, 
etc.) were included in the analysis. The titles consisted mainly of short 
noun phrases and were clearly not indicative of the grammar used for 
sentences within the stories. Speaker identifiers, too, had a structure 
of their own, and it seemed appropriate to consider them merely as labels. 
To consider "Susan said, "Toy Mouse wants, a ride."" as the sentence for 
analysis rather than simply "Toy Mouse wants a ride." would distinguish 
sentence types in a way which would not emphasize the most interesting 
characteristics of the grammar. In that case, "Susan said, "Toy Mouse 
wants a ride."", ""Toy Mouse wants a ride," said Susan.", and ""Toy Mouse, 
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said Susan, "wants a ride."" -would have to be considered as separate sentence 
typesj while this is a meaningful distinction, it was not considered 
essential for a first syntactical analysis . 

Structures which -were included in the analysis will from here on he 
called "utterances". An utterance is a string of -words which begins with 
a capital letter and ends with one of the following terminal punctuation 
marks: n V*, or a followed by a speaker identifier and a 

”. ,r . (The symbol used to indicate abbreviations was not considered 
terminal.) Speaker identifiers found at the beginning or the middle of an 
utterance were ignored, and all punctuation was ignored except for the 
purpose of distinguishing utteranceso Thus, the sentence, ""I want it," 
said Bill, "but I can’t eat it/'" includes one utterance: "I want iu but 

I can’t eat it"; whereas the sentences, ""I want it," said Bill. "But 
I can’t eat it."" include two utterances: "I want it" and "But I can’t 

eat it". 

All of the utterances in both first-grade reading series were 
included in the analysis. The number of utterances in each text is shown 
in Table 1, 

Insert Table 1. about here 

The table indicates that the number of utterances in each division 
were roughly the same for each series, although the number of utterances 
in the Ginn series was consistently greater. 

■ The first step in. the analysis was the coding of each utterance 
according to type. Each utterance was coded as an ordered n-tuple 
consisting of the part of speech of each word in the order of occurrence. 

The part of speech was determined according to rules commonly used for 
English grammars. Table 2 lists ail parts of speech found in the corpus 
along, with the abbreviations used in coding (and throughout this paper), 
typical examples, and examples- of any phrases of two or more words which 
were considered to be a unit and coded- as only one part of speech.. 

Insert Table 2 about here 







TABLE 1 



Text 

Ginn 

Scott-Fore sman 
Totals 



Utterance Count 



Pre- Primer 


Primer 


Reader 


Total 


CO 

LT\ 

CO 


1343 


1925 


4126 


638 


io4o 


1450 


3128 


1496 


2383 


3375 


7254 



TABLE 2 



Parts of Speech in Corpus 



O 

ERIC 



Part of 



Speech 


Abbreviation 


Example 


adjective 


A 


pretty 


advert 


ADV 


fast 


article 


T 


the 


common noun 


N 


house 


conjunction 

copulative 


C 


and 


verb 


cv 


is 


interjection 


I 


oh 


interrogative 


adjective 


IADJ 


which 


interrogative 


adverb 


IADV 


how 


interrogative 
pronoun, 
objective . 


case 


IP(2) 


whom 


interrogative 

pronoun, 

subjective 


case 


IP(1) 


who 


intransitive 


verb 


IV 


go 


locative 


L 


here 


modal 


M 


can 


negation 


- 


not 


number used 


in counting 


NBR 


one 


preposition 


J 


into 


pronoun, 

objective 


case 


P<2) 


him 


pronoun, 

subjective 


case 


P(l) 


he 


proper noun 


G 


Betty 


rejoinder 


R 


yes 


relative 

pronoun, 


objective case 


RP(2) 


whom 


relative 

pronoun, 


subjective case 


RP(1) 


who 


salutation 


S 


hello 


sound 


z ... 


zoom 


subordinate 


conjunction 


CON 


, that 


"to" used with 


infinitives 


0 


to 


transitive verb 


TV ’ 


want 









Examples of Types 
Containing more than 1 Word 



ice cream 



Mr. Green, Frisky Kitten 
• all right, thank you 



Good day, Happy birthday 



Me. gaeea. Frisky Kitten 




Clearly a given word could be assigned to different parts of speech 
depending on its usage in a particular utterance « In "I saw the boy who 
hit Mary." "who” is a relative pronoun, whereas in "Who is he?" "who" is 
an interrogative pronoun. Similarly the coding of phrases such as "Frisky 
Kitten" depended on usage. The phrase, "Frisky Kitten" was usually used 
as the name of a cat in which case the phrase was coded as a proper noun 
(G, not G+G); however, in the sentence, "My pet is a frisky kitten." 

"Frisky Kitten" would be coded as an adjective followed by a common noun. 
Table 3 shows some coded types to illustrate the coding method. 

Insert Table 3 about here 

A word must be said about the classification of verbs. Verbs were 
put into one of three categories according to their position in the 
utterance. Verbs which stood between two noun phrases were labeled 
transitive verbs; verbs which were preceded but not followed by a noun 
phrase were labeled intransitive verbs, and verbs which were preceded by a 
noun phrase and followed by an adjective phrase were labeled copulative 
verbs. (Nouns used as adverbs as in "He goes home." were classified as 
such so that in this sentence "goes" is intransitive.) Under this 
classification "to be" verbs can be in any of the three classifications 
depending on usage; the verb is transitive in "He is a very good boy.", 
intransitive in "Sally is here with me.", and copulative in "Your new .dress 
is very pretty." If "to be" verbs had been ^placed in a category of their 
own as is often done, a fourth category would have been required for verbs 
such as ”3.ook" and "get" in "She looks beautiful." and "She is getting more 
beautiful every day." As will be seen, a problem of too many types and 
small frequencies within types was already present and to add a fourth 
category for verbs would only increase this problem. 

The utterances were then sorted according to type., This original 
classification scheme produced a very large number of types with relatively 
small frequencies of occurrence as is illustrated in Tables 4 and 5’ 



Insert Tables 4 and 5 about here 




TABLE 5 

Frequently Occurring Utterance Types 
Coded Type Examples 



(iv+l+k) 


Come here ; Betty. 


(P(l)+TV+T+A+N) 


I like the little chairs. 


(IV+J+P(2)) 


Look at him. 


( IV+C+TV+G) 


Come and see Susan. 


(P(l)+M+IV) 


I can paint. 


(K) 


Betty*. 


(G+CV+A) 


Jane is pretty. 


(z) 


Bow-wow a 


(I+K) 


Oh; Dickt 


(L+P(l)+IV) 


Here I come. 



O 

ERIC 

hfliflaffBHaaaa 




TABLE 4 

Statistics for Utterance Types under Original Classification 



Text 


Total Number 
of Types 


$ of Types with 
Frequency > 5 


# of Types with 
Frequency > 1 


Ginn Pre-Primer 


250 


10.4 


45.2 


Scott-Fore sman 
Pre-Primer 


277 


8.3 


28.9 


Pre-Primers Combined 


454 


10.1 


39-9 


Ginn Primer 


664 


5.0 


28.8 


Scott-Fore sman Primer 


666 


3-3 


19.1 


Primers Combined 


1185 


4.6 


26.3 


Ginn Reader 


1099 


3-3 


25.8 


Scott-Fore sman Reader 


1096 


1.6 


12.8 


Readers Combined 


2015 


2.9 


21.3 




TABLE 5 



Distribution of Utterance Types under Original Classification 



Number of Types with Stated Frequency Range 



Frequency 


Pre-Primers Combined 


Primers Combined 


Readers Combined 


131“ l40 


1 


0 


0 


121-130 


0 


0 


0 


111-120 


0 


0 


0 


101-110 


0 


1 


0 


91-100 


0 


0 


1 


81-90 


1 


0 


0 


71-80 


0 


0 


0 


61-70 


0 


0 


1 


51-60 


0 


0 


0 


iH-50 


1 


0 


1 


31-i+0 


0 


2 


1 


21-30 


6 


7 


k 


11-20 


16 


20 


15 


1-10 


429 


1155 


1992 



O 




. : ‘ i 
* $ 

V . ’ ' 

Table 4 shows that contrary to expectation the Ginn and Scott-Fore sman 
series do not in general contain the same utterance types. The Ginn reader, 
for example, contains 1099 utterance types and the Scott-Fore sman' reader 
contains 1096 types; thus if the readers had no types :Ln common, the 
readers combined would contain 2195 types. In fact, the readers combined 
contain 2015 types indicating that only l8o or about 1 6$ of the utterance 
types in each reader are common to both readers. Further study indicates 
that the types common to both readers are those with the greatest 
frequencies in the individual readers, and that the types with small 
frequencies in one reader do not usually find a match in the other reader. 

It is interesting to note that the number of utterance types in each 
section of each series is nearly the same; the number of types in the two 
pre-primers differs by 27 and the number of types in the primers and 
readers differs by 2 and 5 respectively. In each case the Scott-Foresman 
series contains the greater number of utterances types which was unexpected 
because the Ginn series contains the greater number of utterances. 

(Table l) 

To construct a grammar for the corpus and apply the chi-square 
statistic as indicated in Chapter I (this use of the chi-square will be 
presented in detail in Chapter III), a smaller number of utterance types 
is desirable and greater frequencies within types is required. Elimination 
of all types with frequency less than or equal to five, for example, was 
not feasible at this point because only a small percent of the corpus 
would then be described by the grammar (55° 5$ of the pre-primers combined, 
33*5^ of the primers combined, and 24*3$ of the readers combined). In 
order to obtain a smaller number of types and greater frequencies within 
types, a method for meaningfully collapsing the types was developed. This 
involved the combining of certain groups of letters (parts of speech) 
into one category whenever they occurred. One grammar would then be 
.written to represent each set of rules for combination, and another grammar 
would be written for the utterance types containing the collapsed 
categories. 

v The first attempt, involved the use of noun phrases and verb phrases. 
Pronouns and strings of articles, adjectives, and nouns were replaced by 
ff NP ,f for "noun phrase", and strings of models,, negatives, and verbs were 
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replaced by "VP" for "verb phrase." For example the sentences, "Betty doe 
not want the pretty red ball," of type (G+M+--KTV+T+A+A+N) and "l.iat puppy 
likes me." of type (A+N-KFV+P(2)) would both be of type (NP+VP+NP) under 
noun- phrase and verb-phrase collapsing. As Table 6 shows, this made a 
substantial reduction in the number of types, but a large number of types 
•with low frequencies remained* 

Insert Table 6 about here 

The final collapsing involves the use of a category called "verbal 
modifiers" (VM) as well as noun phrases and verb phrases* In the verbal 
modifier category are strings of adverbs, prepositional phrases, locatives 
and noun phrases used as adverbs. The utterances, "He runs fast." 

( P( l) +IV+ADV ) , "He runs very fast*" (P(l)+iy+ADV+AI)7) , and "He runs to the 
house." (P(l)+IV+J+T+N) , would all be of type (NP+VP+VM) under the final 
collapsing. Complete details of the noun- phrase, verb-phrase, and verbal- 
modifier classification systems will be given in Chpater III when the 
phrase- structure grammars are presented. 

As Table 6 shows, this final collapsing greatly reduced the number 
of types compared to the first two classification systems. While the 
percent of types with frequencies greater than five remains low, the 
percent of the corpus which can now be accounted for by types with 
frequencies greater than five has substantially increased. Eighty- four 
percent of the corpus can now be described by types with frequency greater 
than five. 

Tables 7 and 8 show further details of the number and frequencies 
of types which have been formed using the noun- phrase, verb-phrase, and 
verbal-modifier categories. 

Insert Tables 7 and 8 about here 

Table 7 indicates that in general the trends apparent under the 
original classification system (Table 4 ) have continued. Each section of 
the Scott-Fore sman series contains a greater number of types but a smaller 
percent of types with frequency greater than five (and greater than l) 
than the corresponding Ginn section. This implies that the Scott -Fore sman 
books contain a greater variety of sentence types than the Ginn books 
and in this sense are more difficult. The total number of types for the 
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TABLE 7 

Statistics for Utterance Types under 
Final Classification 

Total Number $ of Types with $ of Types with 



Text 


of Types 


Frequency > 5 


Frequency > 1 


Ginn. Pre-Primer 


90 


30.0 


56.7 


Scott-Foresman 

Pre-Primer 


116 


16.4 


54.3 


Pre-Primers Combined 


153 


24.2 


57.5 


Ginn Primer 


204 


23-5 


49.0 


Scott-Foresman Primer 


262 


12.6 


40.5 


Primers Combined 


364 


17-3 


46.2 


Ginn Reader 


325 


14.8 


44.3 


Scott-Foresman 

Reader 


511 


9.2 


28.2 


Readers Combined 


689 


11.6 


35-3 


All Combined 


882 


14.7 


42.0 





/ 0?ABLE 8 

Distribution of Utterance Types under Pinal Classification 



Frequency 

621-630 

561-570 

341-350 

• 

• 

311-320 

301-310 

' • 

251-260 

241-250 

231-240 

• 

211-220 

« 

191-200 

181-190 

171-180 

161-170 

151-160 

141-150 

151-140 

121-150 

111-120 

101-110 

91-100 

81-90 

71-80 

61-70 

51-60 

41-50 

51-40 

21-50 

11-20 

1-10 



•e-Primers 

Combined 



0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

1 

2 

2 

0 

1 

2 

2 

6 

9 

126 



Primers 

Combined 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

2 

1 

0 

2 

1 

1 

3 

7 

5 

20 

520 



Readers 

Combined 

0 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

1 

1 

1 

3 

3 

2 

8 

3 

19 

644 



All 

Combined 

1 

1 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

2 

1 

1 

2 

2 

0 

2 

2 

1 

1 

2 

1 

3 

10 

16 

29 

800 



combined sections still indicates a large number of types present in 
only one reader , but the percent of common types is now much greater than 
in the previous classification systems. Again, it is generally the small 
frequency types which are present in only one series. 
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CHAPTER III 

PHRASE- STRUCTURE GRAMMARS 

In this chapter the phrase- structure grammars and their corresponding 
statistical analyses will he presented. Basically , a phrase- structure 
grammar is concerned with the division of sentences into their constituent 
parts--into phrases, subphrases, and finally into word categories. It 
consists of a set of rewrite rules from which, ideally, all of the grammatical 
types and none of the ungrammatical types can be generated. 

For this corpus separate grammars, have been written for the noun 
phrases, the verb phrases, and the verbal modifiers, and these specific 
grammars have been utilized in the grammar for the complete utterances. 

The grammar for utterances will generate such types as (NP+VP+NP) , but 
will not generate such types as (T+A+N+TV+A+A+N) . To generate a form of 
the latter type, first the utterance grammar , and then the noun- phrase, 
verb-phrase, and verbal-modifier grammars must be used. The reason for 
the construction of several smaller grammars instead of one large grammar 
concerns the large number of sentence types and the extremely low frequencies 
of occurrence within each type which one large grammar would have to 
generate. This has been more fully explained in the second chapter. 

The grammar for complete utterances has been written as three 
separate grammars, a grammar for statements with verbs, a grammar for 
statements without verbs, and a grammar for interrogatives. Originally a 
fourth gr amm ar was planned--one for compound sentences } however, this type 
of utterance comprised only 1.1$ of the corpus, and the type frequencies 
were too low for statistical analysis. No grammar was written, but further 
details of the compound utterances will be given later in this chapter. 



i Vi ; : 1 . ' ' 

• 22 
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A total of six grammars, then, have been constructed for the corpus; 
the number of phrases involved of each of these and for the compound 
utterances has been summarized in Table 9 • 

Insert Table 9 about here 




The noun-phrase grammar will be presented first and will be used to 
illustrate a phrase- structure grammar as well as to demonstrate the details 
of the statistical analysis including the assignment of parameters, the 
determination of the theoretical probabilities, the maximum- likelihood 
function, and the calculation of the chi-square values. This form of 
analysis was followed for all grammars, and will not again be reported in 
such detail. 

Noun-Phrase Grammar 

The rewrite rules of the noun- phrase grammar are shown in Table 10. 
The complete noun- phrase grammar includes some transformations, but these 
will be explained later. 



Insert Table 10 about here 

The symbols used in Table 10 (and throughout this paper) have the 
meanings commonly assigned to them by linguists: 

-» : rewrite as 

\\ : choose at most one from within 

( ): may delete contents 

The meanings of the abbreviations may be found in Table 2. (Chapter II) 

Table 11 shows the parameters which have been assigned to the choice 
points of the grammar. Throughout this paper variables which must sum to 
one have the same name and different subscripts. Thus in the noun-phrase 
grammar, A^+A^+A^ 

will be used for parameters because many of the tables are in computer 
output off of Model- 33 teletypes which do not have the more traditional 
small Greek letters. 

Insert Table 11 about here 

The parameters incidated in Table 11 represent the following 
probabilities: 



+ \ = 1, bi 1 +bi 2 = 



1 , 



and 32 ^ 32 ^ 32 ^ = 



Capital letters 
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TABLE 10 



Rewrite Rules of the Noun-Phrase Grammar 



NP -» P 
NP -* G 

NP -» (T) + (£ a + a $) + N 
NP A+ A +A +N 



TABLE 11 



Rewrite Rules and Parameters 
of the Noun- Phrase Grammar 



Rewrite Rules 



NP -» P 



Parameters 

Within- 

Rule-Choice Rule-Choice 
Probabilities Probabilities 

A, 



NP -> G 

NP -» (T) + (Ltj) + N 



l A+A 

NP -» A + A +A+N 



B1 3 

Bl. 



B2q 

B2 



O 

ERIC 

imimffamiaaa 



A^: Probability of choosing the first noun-phrase rule 

A^: Probability of choosing the second noun-phrase rule 



A_: Probability of choosing the third noun-phrase rule 

5 * 

A^: Probability of choosing the fourth noun-phrase rule 

Bl^: Probability of choosing "T” 

Bl^: Probability of deleting "T" 

B2^: Probability of choosing "A" 

B2 2 : Probability of choosing "A+A" 

B2,: Probability of deleting either choice 

5 



Note that while nine parameters have been indicated for this model, 
only six of the parameters are free to vary; the remaining three will be 
determined because of the three sets of parameters which must sum to one. 

The theoretical probabilities of each string derivable from the grammar 
can now be determined according to the generation rules; the usage of the 
generation rules is best illustrated by a phrase-marker. For ex/ample, 
consider the string, (T+A+N). 



The theoretical probability of any grammatical type is the product 
of the choice probabilities; in this case the theoretical probability is: 



Note that for this grammar and all other grammars, the generation ends 
with a non-terminal vocabulary of words describing parts of speech rather 
than a terminal vocabulary of words such as: "the", "girl", "pretty", etc. 

In this way the theoretical probabilities of all strings derivable 
from the grammar can be determined. These are presented in Table 12. 

Insert Table 12 about here 

As Table 12 shows, nine types of noun phrases are derivable from the 
model and six free parameters are used in the theoretical probabilities. 
Thus the original model allows two degrees of freedom. The number of 
degrees of freedom for this grammar and for the following grammars is 
sometimes decreased because of low predicted frequencies; whenever the 



Phrase -Marker 



Choi ce Probability s 



NP 

* 

T+A+N 




1) A 3 -B1 1 .B2 1 




TABLE 12 



Types Derivable from the Noun-Phrase Grammar 
and their Theoretical Probabilities 

Type Theoretical Probability 



P 

G 

N 

A+N 



A+A+N 

T+N 

T+A+N 

T+A+A+N 

A+A+A+N 
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predicted frequency of a type was less than five, the observed and 
predicted frequencies of that type were combined with those of the 
following types until the total predicted frequency was greater than five. 

One degree of freedom was lost each time one cell was combined with another. 

This will be illustrated when the statistics for the noun-phrase grammar 
are presented. 

The parameters were estimated according to the method of maximum- 
likelihood. This method is easily applicable as long as the grammar is 
unambiguous; that is, as long as each syntactical type can be generated 
in only one way. This can be seen to be the case for the noun-phrase 
grammar and for all the remaining grammars with the exception of one 
utterance type in the grammar for interrogatives. When more than one 
derivation for a given syntactic type is possible, the maximum- likelihood 
method is still workable, but the calculations are much more difficult. 

The theoretical probability for a type with more than one derivation 
consists of a sum of terms similar to the probability given on line 1, 
and this greatly increases the complexity of the likelihood function. 

The likelihood function consists of a product of N terms where N is 
the number of different grammatical types derivable from the grammar. 

For the noun phrase grammar N is nine. Each term is the theoretical 
probability of one of the syntactical types raised tS the power which is 
the number of occurrences of that type. For the noun phrases the likelihood 
function is: 

2) (Ap^ • (A 2 )% • (A^ “Bl 2 • B2,)% • (A 5 “Bl 2 .B2 1 ) f A+N « (A^ELg »B2 2 ) f A+A+N ■ 
(A 5 »B1 1 “B2 5 )%+W . (A 5 »B1 1 “B2 1 )%I-A+N • {A^Bl^m^T+A+A+H ■ 
(A^) f A+A+A+N 

where "f" stands for observed frequency and the type is indicated as a 
subscript. Note that in the case of a grammar which generates an infinite 
number of types, the likelihood function is truncated in terms of frequencies. 
The likelihood function does not reflect any decision about the existence 
of a fixed finite number of syntactical types in the corpusj it is only a 
decision relative to frequencies in the corpus. 

The maximum- likelihood estimates were then substituted into the 
theoretical probabilities to obtain numerical theoretical probabilities, 



and the products of these with the total number of phrases give the 
theoretical frequencies. Chi-square values were then computed to determine 
the goodness-of-f it of the predicted frequencies to the observed frequencies. 

To complete the noun-phrase grammar some obligatory transformations 
were necessary. Such transformations are applied after all applicable 
generation rules have been used and are used to handle cases in which 
under certain conditions exceptions from the generative rules always occur. 
For the noun- phrase grammar the obligatory transformations are: 

a) If "N" is " something", A+N -» N+A. 

b) If "A" is "what" or "all", T+A+N. -> A+T+N. 

c) If "A 1 " is "what" or "all", T+A^Ag+N -* A^+T+A^N, 

Transformation (a) is necessary to transform a phrase like "red 

something" (A+N) into "something red" (N+A) ; transformations (b) and (c) 
change phrases like "the all children" (T+A+N) and "a what big ball" 

(T+A+A+N) to "all the children" (A+T+N) and "what a big ball" (A+T+A+N) . 

To generate a string given by a transformation, the string to which 
the transformation is applied is derived according to the generative rules. 
Then, if the terminal vocabulary selected for the non-terminal string 
satisfies the assumptions of the transformation, the transformation is 
automatically applied. This means that for purposes of analysis, 
instances of strings which are generated by obligatory transformations 
are counted as occurrences of the corresponding strings before the 
transformation. ■ Thus the observed frequency of the grammatical type (A+N) 
is the sum of the frequencies of .type (A+N) and type (N+A) where "N" was 
chosen to be "something." Obligatory transformations do not effect the 
number of parameters or the theoretical probabilities. 

Optional transformations are also possible, and these do effect 
the number of parameters and the theoretical probabilities. The possible 
need for optional transformations is apparent in grammars for complete 
sentences; for example, one might want to change "I am coming." to "I come." 
some of the time, but not always. This choice could be made available by 
an optional transformation. Optional transformations require- a parameter 
for each option, and these parameters become terns in the products 
representing the theoretical probabilities. The parameter estimation and 



the remaining statistical analyses are the same whether or not optional 
transformations are used* Optional transformations were not necessary 
for the noun- phrase grammar or for any of the phrase- structure grammars » 

The categorial grammar for noun phrases, given in the fourth chapter, 
does use an optional transformation. 

For the noun-phrase grammar the analysis described above was run on 
the individual sections of the corpus — the Ginn pre-primer, the Scott - 
Foresman pre-primer, the pre-primers combined, the Ginn primer, the Scott - 
Foresman primer, the primers combined, the Ginn (first) reader, the Scott - 
Foresman reader, and the readers combined — as well as on the entire corpus. 
This was true of the other grammars whenever the type frequencies were 
sufficiently large. Tables 13-l6 summarize the results of the noun- 
phrase analysis. Table 13 shows the percent of each section of the 
corpus accounted for by the grammar (that is, the percent of utterances 
in the corpus whose syntactical types are derivable from the grammar); 

Table ik gives the maximum- likelihood estimates; Table 15 provides a 
comparison of the total chi-squares for each section; and Table 1 6 shows 
for each section the observed and theoretical frequencies and corresponding 
chi- square contributions of each type. All computations for Tables l^-l 6 
and other tables of a similar nature were carried’ out with five decimal 
digits; the resulting values were rounded off at the time of output to 
those shown in the tables. 

Insert Tables 13-16 about here 

The second page of Table 16 , the page for the Scott-Fore sman pre-primer, 
illustrates all of the notation used in this table and in the corresponding 
tables for the remaining grammars. The source designated "Expected Freq. 

Less Than 5«0" is the total from categories which have been combined because 
of low expected frequencies; it occurs whenever the expected frequency has 
accumulated to 5*0. For the Scott-Fore sman pre- primer, the observed and 
expected values of this source are the sums of the corresponding values 
of types (A+A+N) and (T+A+A+N); the chi-square is obtained from the total 
values. Here two categories were combined into one, and so one degree 
of freedom was lost at this point. A second degree of freedom was lost 
because for the last type, (A+A+A+N), the expected frequency was again less 
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TABLE 15 



r. 


Comparison of 


Total Chi-Squares 


for Noun-Phrase 


Grammar 


7" 




No. of Phrases 


Total 


Degrees 


li 


Text 


Accounted For 


Chi-Square 


of Freedom 


17 


Ginn Pre-Primer 


772 


.5 


1 


!i 


Sc ott-Pore sman 
Pre- Primer 


672 


2.3 


0 


li 


Pre-Primers Combined 


1444 


1.3 


1 


IT 


Ginn Primer 


1664 


52.2 


2 


li 


Scott-Fore sman Primer 


1482 


42.6 


1 


f i 


Primers Combined 


3146 


95-3 


2 




Ginn Reader 


3081 


81.5 


2 


11 


Scott-Fore sman Reader 


2651 


177.5 


2 


17 


Readers Combined 


5732 


247.6 


2 


li 


All Combined 


10322 


316.8 


2 



0 

1 ] 




TABLE 1 6 



Observed and Expected Frequencies, Chi-Square Contributions, and Total 
Chi-Squares for each Section of the Corpus for Noun-Phrase Grammar 



GINN PRE- PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


230 


230.0 


.0 


P 


254 


254.0 


.0 


G 


76 


73.4 


.1 


N 


23 


24.2 


.1 


A+N 


8 


9.3 


.2 


A+A+N 


121 


123.6 


.1 


T+N 


42 


40.8 


.0 


T+A+N 


17 


15.7 


.1 


T+A+A+N 


1 


1.0 




A+ A+A+N 


1 


1.0 




RESIDUAL 


772 


772.0 


.5 


TOTAL 



1 DEGREES OF FREEDOM 



SCOTT-FORESMAN ERE- PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


424 


424.0 


.0 


P 


132 


132.0 


.0 


G 


33 


36.3 


.3 


N 


24 


20.0 


.8 


A+N 


4 


4.7 




A+A+N 


36 


32.7 


.3 


T+N 


14 


18.0 


.9 


T+A+N 


5 


4.3 




T+A+A+N 


9 


9.0 


.0 


EXPECTED FREQ. LESS THAN 5 


0 


.0 




A+ A+A+N 


0 


.0 




RESIDUAL 


672 


672.0 


2.3 


TOTAL 






0 


degrees OF FREEDOM 



O 
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TABLE 1 6 (continued) 



f: 



PRE-PRIMERS COMBINED 



0 

0 

0 

0 

i 

0 

D 

G 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


654 


654.0 


.0 


P 


386 


386.0 


.0 


G 


109 


110.9 


.0 


N 


47 


42.9 


.4 


A+N 


12 


14.2 


.3 


A+A+N 


157 


155.1 


.0 


T+N 


56 


60.1 


.3 


T+A+N 


22 


19.8 


.2 


T+A+A+N 


1 


1.0 




A+A+A+N 


1 


1.0 




RESIDUAL 


1444 


1444.0 


1.3 


TOTAL 






1 


DEGREES OF FREEDOM 





GINN 


PRIMER 






OBSERV. 


EXPECT. 


CHI**2 


SOURCE 




727 


727.0 


.0 


P 




286 


286.0 


.0 


G 




86 


130.8 


15.4 


N 




123 


92.8 


9.8 


A+N 




56 


41 .4 


5.2 


A+A+N 




227 


182.2 


11.0 


T+N 




99 


129.2 


7.1 


T+A+N 




43 


57.6 


3.7 


T+A+A+N 




17 


17.0 


.0 


A+A+A+N 




0 


.0 




RESIDUAL 




1664 


1664.0 


52.2 


TOTAL 





e 



2 DEGREES OF FREEDOM 




] 



\ : i ■ 
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TABLE 1 6 (continued) 



SCOTT-FORESMAN PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


784 


784.0 


.0 


P 


220 


220.0 


.0 


G 


€3 


98.2 


12.6 


N 


110 


85.5 


7.0 


A+N 


43 


32.3 


3.6 


A+A+N 


153 


117,8 


10.5 


T+N 


78 


102,5 


5.9 


T+A+N 


28 


38.7 


3.0 


T+A+A+N 


3 


3.0 




A+ A+A+N 


3 


3.0 




RESIDUAL 


1482 


1482.0 


42.6 


TOTAL 



1 DEGREES OF FREEDOM 



PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


1511 


1511.0 


.0 


P 


506 


506.0 


.0 


G 


149 


229.4 


28.2 


N 


233 


177.8 


17.1 


A+N 


99 


73.7 


8.7 


A+A+N 


380 


299.6 


21.6 


T+N 


177 


232.2 


13.1 


T+A+N 


71 


96.3 


6.6 


T+A+A+N 


20 


20.0 


.0 


A+A+A+N 


0 


.0 




RESIDUAL 


3146 


3146.0 


95.3 


TOTAL 



2 DEGREES OF FREEDOM 




TABLE 16 (continued) 



I 

0 



n 



GINN READER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


1143 


1143.0 


.0 


P 


473 


473.0 


.0 


G 


215 


297.5 


22.9 


N 


238 


185.6 


14.8 


A+N 


103 


72.9 


12.4 


A+A+N 


556 


473.5 


14.4 


T+N 


243 


295.4 


9.3 


T+A+N 


86 


116.1 


7.8 


T+A+A+N 


24 


24.0 


.0 


A+A+A+ N 


0 


.0 




RESIDUAL 


3081 


3081 .0 


81.5 


TOTAL 



2 DEGREES OF FREEDOM 



D 



i 



o 




o 



SCOTO-FORESMAN READER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


997 


997.0 


.0 


P 


323 


323.0 


.0 


G 


239 


359.2 


40.2 


N 


370 


278.1 


30.4 


A+N 


97 


68.7 


11.6 


A+A+N 


430 


309.8 


46.6 


T+N 


148 


239.9 


35.2 


T+A+N 


31 


59.3 


13.5 


T+A+A+N 


16 


16.0 


.0 


A+ A+A+N 


0 


-.0 




RESIDUAL 


2651 


2651.0 


177.5 


TOTAL 



ii 




tRIC 



2 



DEGREES OF FREEDOM 
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L 6 (continued) 



READERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


2140 


2140.0 


.0 


P 


796 


796.0 


.0 


G 


454 


659.4 


64.0 


N 


608 


457.5 


49.5 


A+N 


200 


145.2 


20.7 


A+A+N 


986 


780.6 


54.0 


T+N 


391 


541.5 


41.9 


T+A+N 


117 


171.8 


17.5 


T+A+A+N 


40 


40.0 


.0 


A+ A+A+N 


0 


.0 




RESIDUAL 


5732 


5732.0 


247.6 


TOTAL 






2 


DEGREES OF FREEDOM 



AIL COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


4305 


4305.0 


.0 


P 


1688 


1688.0 


.0 


G 


712 


1000.7 


83.3 


N 


888 


677.0 


65.8 


A+N 


311 


233.3 


25.9 


A+A+N 


1523 


1 2o4 .3 


67.5 


T+N 


624 


835.0 


53.3 


T+A+N 


210 


287.7 


21.0 


T+A+A+N 


61 


61.0 


.0 


A+A+A+N 


0 


* e I 




RESIDUAL 


10322 


10322 


316.8 


TOTAL 






2 


DEGREES OF FREEDOM 
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than five. The line labeled "Residual" contains types such as (A+A+A+N) 
whose individual predicted frequencies were less than 5 and whose total 
predicted frequency was less than that is, the residual contains all 
types whose predicted frequencies were less than 5 and which fell below the 
last "Expected Freq. Less Than 5*0" line. The residual also contains 
round-off errors (causing some negative values) and the values (observed 
frequency, predicted frequency, and chi-square contribution) for unlisted 
types. Unlisted types are those generated by the grammar but not included 
separately in the source column. The phrase- structure grammars generate a 
finite number of types and these are all listed so there will never be 
unlisted types; the verb-phrase categorial grammar, however, generates 
an infinite number of types and the values of those with non-zero 
probabilities will be included in the residual. When the expected frequency 
of the residual is greater than 5 (possible because of unlisted types), 
the chi-square is evaluated and one degree of freedom is added; otherwise 
it is not. Due to the two degrees of freedom lost through collapsing, 
the noun-phrase grammar for the Scott-Fore sman pre-primer has zero degrees 
of freedom. The chi-square for this part of the corpus must be studied 
for descriptive purposes only and must not be used to determine significance 
levels. 

As Table 13 indicates, the noun-phrase grammar accounts for a very 
high percent ( 98 . 2 $) of the noun phrases in the entire corpus, and for a 
nearly equivalent percent of each section of the corpus. It does not 
generate any types which do not appear in the corpus; in fact, all derivable 
types occur at least 50 times in the corpus as a whole. Thus according 
to the criteria of generating all types appearing in the corpus, and only 
those, this grammar is very good. 

The chi-square tables (Tables 15 and 16 ) indicate the "goodness" 
of the grammar with regard to type frequency. In discussions of the chi- 
square values the term "comparative chi-square value" will be used to 
indicate that any differences in degrees of freedom have been considered 
and the effects incorporated into the comparisons. When differences in 
degrees of freedom made comparisons difficult, an F-test of significance 
was used. For the noun- phrase grammar the comparative chi-square values 
increase significantly form the pre-primers to the primers and from the 
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primers to the readers, indicating that with regard to frequency the grammar 
is much more representative of the pre-primers than the primers and readers. 

For the pre-primers and primers the comparative chi-squares for the Ginn and 
Scott-Fore sman hooks are roughly equivalent, hut for the readers the Scott- 
Foresman chi-square is considerably greater. With respect to type frequency, 
the grammar is a much better representation of the Ginn reader than the 
Bcott -Fore sman reader » 

With the exception of the pre-primers the comparative chi-square 
values are higher for the combined volumes than for either volume separately, 
and the comparative chi-square for the total corpus is the largest of all. 

This trend indicates the difference in noun-phrase type- frequency patterns 
:for the two reading series and for the different sections within each 
series. The difference is apparent in the observed frequencies, and is 
most obvious in the readers. The Scott-Foresman reader, for example, has 
more phrases of type (N) than of any other type containing “H 11 , while in 
the Ginn reader phrases of types (A+N), (T+N), and (T+A+N) all have greater 
frequencies than phrases of type (W) . 

Table 1 6 gives further details regarding the chi-square values. The 
first, second, and last noun phrase types, (P), (G), and (A+A+A+N), 
contributed nothing to the chi-square value because the theoretical probabilities 
of each of these types consisted of only one parameter. For the pre-primers 
and the pre-primers combined the comparative chi-square values are consistently 
low. For the remaining sections consistently high contributers to the 
total chi-square are the (N), (A+N), and (T+N) types, with predictions 
for type (N) being consistently high and predictions for types .(A+N) and 
(T+N) being consistently low. For the readers individually and combined 
the predictions for the (T+A+N) types are high yielding additionally large 
chi-square contributions.. The table also shows that the great difference 
in total chi-square between the Ginn and Scott-Foresman readers is caused 
by higher contributions for all types in the Scott-Foresman book rather 
than one extremely large contribution from one or a few types. 

Figure 1 shows the fit of the noun- phrase grammar to the entire corpus. 

Th£ observed frequencies were arranged in rank order and plotted accordingly; 
the predicted frequencies were plotted at the rank of their corresponding 
observed value. 
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Insert Figure 1 about here 
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Fig. 1. Comparison of observed and predicted frequencies 
for noun phrases (entire corpus). 
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The figure shows an extremely good fit.. With the exception of type 
(A+N), the predicted rank order is the same as the observed, and the two 
curves are nearly identical » However in view of the large number of parameters 
which have been used and the small number of degrees of freedom, it is not 
surprising that some sort of fit has been obtained* It is hoped that in a 
later stage of investigation the number of parameters can be reduced. 

Verb-Phrase Grammar 

The verb-phrase grammar including the rewrite rules and four 
obligatory transformations is presented in Table 17 • 

Insert Table 17 about here 

The parameter notation used for the noun-phrase grammar has been 
continued for the verb-phrase grammar and will be used throughout this 
chapter. The parameters always denote a rule-choice probability, and 
the Bj^ parameters denote probabilities of choices within a given rule. In 
a column of parameters* the first member indicates the probability of 
the first (the top) choice within the parentheses, the second member the 
second choice, and so on, and the last member of the column indicates the 
probability of choosing nothing from that set of parentheses. In the verb- 
phrase grammar, KLj. is the probability of choosing B2^ is the 

probability of choosing ,f M and Bl_ is the probability of deleting both 

possibilities. Note that the same parameters were assigned to corresponding 
choices for each verb type. There was no a priori reason to think 1 liat 
modals or negatives would occur a greater proportion of the time with 
transitive verbs than with intransitive or copulative verbs. Nine types 
are derivable from the verb'- phrase grammar and four free parameters are 
used; whenever no collapsing of categories (explained in the noun-phrase 
section) is required, the model has three degrees of freedom. 

The verb phrase grammar employs subscripts to denote verb form. When 
a modal is used, the verb must be in the infinitive form, and this is 
denoted by the obligatory assignment of the subscript, 3> to the modal 
and to the verb once a modal has been chosen. If no modal has been chosen, 
the subscript, 3. (indicating infinitive form), may not be assigned to the 
verb, but the verb may be either singular (i=l) or plural (i=2). In the 
utterance grammars subscripts will be assigned to the subjective noun 

O 

ERIC 

hfliflaffHHaiiaa 
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Rewrite Rules 



VP- 



M 3 

M +' 

3 



+IV. 



VP -> 



M 3 

M +- 

3 



+TV. 



VP-> 



M 3 

M +“ 

3 



+cv i 



TABLE 17 

The Verb-Phrase Grammar 

Parameters 

Rule-Choice Within-Rule-Choice 

Probabilities Probabilities 




B1 1 

B1 2 

Bl, 



B1 : 

Bl, 



EL, 



B1 

B1 

B1 



1 

2 

3 



i = 1 => singular verb form 
i = 2 => plural verb form 

i 3 => infinitive verb form <=> M, occurs 



Obligatory transformations: 




1. If M, is a form of "to be", 


IV. 

TV^ 


f 1 


IV,+ing ) 

3 f 

TV,+ing } 

$ 1 


✓ 


CT 3 




CV +ing 1 

3 » 



2a. If IV, is a form of "to be" or "to do" and M is a form of "to be" 

5 / 3 5 

or "to .do", M 5 +-+IV 3 IV ± +-, ify . 

2b. If IV. is "will", "can", "may", "must", "shall", or "could", 
M 5 +-+IV, -» IV^-, i/3. 

3. If TV, is a form of "to be" and M, is a form of "to be" or "to do", 

■ 3 ,3 3 

M-+-+TV, -» TV . +- , idy. 

3 3 i 

4. If CV, is a form of "to be" and M, is a form of "to be" or "to do", 

3 • .3 3 

M +-+CV CV.+-, hfy. 
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phrases and to the verbs so that subjects and predicates will be forced 
to agree in number and person. However no parameters will be assigned to 
the choice of subscripts; that is, types with singular subjects and 
predicates will not be distinguished from the corresponding types with 
plural subjects and predicates. In a complete grammar this distinction 
might be made, but the problem of many types with low frequencies precluded 
doing so at this stage. 

Because forms of the verb "to be" are available choices for "M^", 
transformation 1 is required to change phrases like "is go" and "are come" 
to "is going" and "are coming". Transformations 2-4 were necessary to 
change phrases like "am not being” and "do not be" into "am not" and 
"do not". While English usage does permit the untransformed forms (as in 
"She is not being very nice." and "Do not be that way."), the untransformed 
forms were not used in this corpus. Transformation 2a, involving 
intransitive verbs, changes phrases'- like "does not do" into "does not", 
so utterances like "She does not," are derivable from the utterance grammar; 
however, transformation 3.? involving transitive verbs, does not allow this 
transformation, and so utterances like "He does not do his homework." 
are also derivable from the utterance grammar. 

Wo work with verb tense has been done in this analysis. For the 
most part in this corpus past and future tenses are formed through the use 
of the auxiliaries "did" and "will". Some use of "ed" endings is made, but 
a refinement of the analysis to include tense characteristics at this 
point would only serve to confuse the presentation and statistical analysis 
of the more important structural characteristics. 

Tables l8 through. 21 present the statistical analysis of the verb- 

i 

phrase grammar. On the tables subscripts are shown in parentheses. It 
should be noted that instances of modal plus verb combinations in which 
the modal and verb were necessarily separated (as in interrogatives such as 
"May she come out with us?") were included in the frequency counts of the 
respective modal plus verb types. Some contractions such as "can’t" and 
"don’t" were used; these were considered to be transformations of the 
terminal vocabulary and as such were not distinguished from the corresponding 
uncontracted forms. Similarly, in the grammars for complete utterances, the 
subject- verb contractions such as "I’ll" and the verb-object contractions 
such as ’let’s" were treated as the uncontracted forms. 



Insert Tables 18-21 about here 
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TABLE 20 

Comparison of Total Chi-Squares for the Verb-Phrase Grammar 



Text 


No. of Phrases 
Accounted For 


Total Chi-Square 


Degrees 
of Freedom 


Ginn Pre-Primer 


894 


4.9 


' 2 


Scott -Fore sman 
Pre-Primer 


657 


4l.4 


1 


Pre-Primers Combined 


1551 


55-5 


2 


Ginn Primer 


1591 


54.9 


* 


Scott-Foresman Primer 


1182 


56.9 


3 


Primers Combined 


2575 


69.1 


4 


Ginn Reader 


2012 


72.9 


3 


Scott-Foresman Reader 


1852 


12.9 


3 


Readers Combined 


3844 


57-5 


4 


All Combined 


7968 


159.1 


4 



0 




TABLE 21 



Observed and Expected Frequencies, Chi-Square Contributions, and Total 
Chi-Squares for each Section of the Corpus for the Verb-Fhrase Grammar 



GINN PRE- PRIMER 



03SERV. 


EXPECT. 


CHI**2 


SOURCE 


462 


456.5 


.1 


IV 


34 


40.8 


1.1 


Wf IV 


11 


9.6 


.2 


Mf-+IV 


336 


340.4 


.1 


TV 


38 


30.4 


1.9 


MfTV 


4 


7.2 


1.4 


PH--+TV 


7 


8.1 


.2 


CV 


0 


.7 




WfCV 


2 


.2 




M*-+ev 


2 


.9 




RESIDUAL 


894 


894.0 


4.9 


TOTAL 






2 


DEGREES OF FREEDOM 



SCOTT-FORESMAN PRE-PRIMER 



ERIC 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


325 


290.5 


4.1 


IV 


30 


57.2 


12.9 


PH* IV 


17 


24.3 


2.2 


PH--+IV 


187 


219.4 


4.8 


TV 


68 


43 .2 


14.2 


MfTV 


26 


18.4 


3.1 


PH--+TV 


1 


3.1 




CV 


3 


.6 




WfCV 


0 


.3 




Mf-+CV 


4 


4.0 




RESIDUAL 


657 


657.0 


41.4 


TOTAL 






1 


DEGREES OF FREEDOM 



TABLE 21 (continued) 



ERE-PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


7g7 


747.0 


2.1 


IV 


64 


98.0 


1 1 .8 


Mfl V 


28 


34.0 


1.1 


Mf-+IV 


523 


560.0 


2.4 


TV 


106 


73.5 


14.4 


Mf TV 


30 


25.5 


.8 


Mf -+TV 


8 


11.0 


.8 


CV 


3 


1.5 




Mf CV 


2 


.5 




Mf-+CV 


5 


1 .9 




RESIDUAL 


1551 


1551.0 


33.5 


TOTAL 



2 DEGREES OF FREEDOM 



GINN PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


579 


545.4 


2.1 


IV 


75 


103.8 


8.0 


Mf I V 


37 


41.7 


.5 


Mf-+I V 


455 


494,9 


3.2 


TV 


131 


94.2 


14.4 


Mf TV 


41 


37.9 


.3 


Mf -+T V 


64 


57.6 


.7 


CV 


3 


21.0 


5.8 


Mf CV 


6 


4.4 




Mf-fCV 


6 


4.4 




RESIDUAL 


1391 


1391.0 


34.9 


TOTAL 



3 DEGREES OF FREEDOM 
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TABLE 21 (continued) 



SCOTT-FORESMAN PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


459 


418.2 


4.0 


IV 


53 


69.3 


3.8 


MflV 


27 


51.5 


11.7 


Mf-fIV 


428 


467.8 


3.4 


TV 


97 


77.5 


4.9 


Mf TV 


78 


57.6 


7.2 


Mf-+TV 


30 


31.0 


.0 


CV 


2 


5.! 


1.9 


M+-CV 


8 


3.8 




Mf-+CV 


8 


3.8 




RESIDUAL 


1182 


1 182.0 


36.9 


TOTAL 



3 DEGREES OF FREEDOM 



PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


1038 


963.3 


5.8 


IV 


128 


172.6 


11.5 


MflV 


64 


94.2 


9.7 


Mf -+I V 


883 


963.3 


6.7 


TV 


228 


172.6 


17.8 


MfTV 


119 


94.2 


6.5 


M+-+TV 


94 


88.5 


.3 


CV 


5 


15.9 


7.4 


Mf CV 


14 


8.7 


3.3 


Mf-fCV 


0 


-.0 




RESIDUAL 


2573 


2573.0 


69.1 


TOTAL 






4 


DEGREES OF FREEDOM 



O 

ERIC 



TABLE 21 (continued) 



4B 




GINN HEADER 



09SERV. 


EXPECT. 


CHI**2 


SOURCE 


90! 


832.2 


5.7 


IV 


121 


173.9 


16.1 


WflV 


61 


77.0 


3.3 


PH--+IV 


585 


663.1 


9.2 


TV 


199 


138.5 


26.4 


MfTV 


79 


61.3 


5.1 


Mf -+TV 


60 


50.7 


1.7 


CV 


3 


10.6 


5.4 


MfCV 


3 


4.7 




M4--+CV 


3 


4.7 




RESIDUAL 


2012 


2012.0 


72.9 


TOTAL 



3 DEGREES OF FREEDOM 



SCOTT-FORESMAN READER 



09SERV. 


EXPECT. 


CHI**2 


SOURCE 


755 


729.2 


.9 


IV 


85 


99.3 


2.1 


Mfl'v 


43 


54.5 


2,4 


M4--+IV 


716 


732.5 


.4 


TV 


109 


99.7 


.9 


MfTV 


62 


54.7 


1.0 


Mt -+T V 


42 


51.2 


1.7 


CV 


12 


7.0 


5.6 


MfCV 


8 


3.8 




ttf-+CV 


8 


3.8 




RESIDUAL 


1832 


1832.0 


!2.9 


tota|. 



3 DEGREES OF FREEDOM 





I 

[ 

[ 




TABLE 21 (continued) 
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READERS COMBINED 



[ 

[ 

[ 

I 



3 — 




03SERV. 


EXPECT. 


CHI**2 


SOURCE 


1656 


1564.5 


5.4 


IV 


206 


270.6 


15.4 


rat-iv 


104 


130.9 


5.5 


IH--+IV 


1301 


1392.6 


6.0 


TV 


308 


240.8 


18.7 


PH- TV 


141 


116.5 


5.1 


Wf “+TV 


102 


101.9 


.0 


CV 


15 


17.6 


.4 


MfrCV 


1 1 


8.5 


.7 


M4--+CV 


0 


-.0 




RESIDUAL 


3844 


3844.0 


57.3 


TOTAL 






4 


DEGREES OF FREEDOM 



ALL COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


3481 


3269.0 


13.7 


IV 


398 


543.6 


39.0 


(H-IV 


196 


262.4 


16.8 


PH--+IV 


2707 


2919.3 


15.4 


TV 


642 


485.5 


50.5 


M4-TV 


290 


234.3 


13.2 


PH--+TV 


204 


203.8 


.0 


CV 


23 


33.9 


3.5 


m-cv 


27 


16.4 


6.9 


FH--+CV 


0 


-.1 




RESIDUAL 


7968 


7968.0 


159.1 


TOTAL 



4 DEGREES OF FREEDOM 
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1 UdUflLHBIflMI 



50 



00054 



Table l8 §hows that like ^he noun- phrase grammar, the verb-phrase 
grammar accounts for a very high percent of the verb phrases in the corpus . 
The only cases not included were those in which the modal and verb were 
separated by an adverb or adjective as in n He was slowly going to Tom e n 
All types derivable from the grammar appear in the corpus. 

Unlike the chi-squares for the noun-phrase grammar, the comparative 
chi-squares for the verb-phrase grammar show no significant increase from 
the pre-primers to the readers (Table 20); in fact the comparative chi-square 
for the readers combined is less, though not significantly less, than that 
for the primers combined. In addition the Scott-Foresman pre-primer has 
a relatively high comparative chi-square and the Scott-Foresman reader a 
relatively low comparative chi- square o Table 21 indicates the reason for 
thiSo In the assignment of parameters, the assumption was made that the 
proportion of modal plus verb and modal plus negative plus verb types 
was equivalent for all three kinds of verbs, and this assumption was more 
nearly fulfilled by the observed frequencies of the Scott-Foresman reader 
than by those of the Scott-Foresman pre-primer. In fact, as Table 21 
indicates, most of the relatively high contributions to the chi-square 
values were the result of this assumption In all but one instance (the 
Ginn pre-primer) the observed proportions of modal plus verb types and 
modal plus negative plus verb types are greater for transitive than for 
intransitive verbs,, 

In general the verb-phrase grammar provides a better fit than the 
noun -phrase grammar; with the exception of the pre-primers the total 
chi-squares are lower and the degrees of freedom are greater. However, 
these differences are significant at the .05 level only for the Scott- 
Foresman reader and the readers combined. In both grammars the power of 
the chi-square tests is very large because of the extremely large number of 
observations. The chance of being right in rejecting the null hypothesis 
is very high. 
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Figure 2 corresponds to Figure 1 for noun phrases and, like Figure 1, 
indicates an extremely good fit 5 again, however, the number of degrees of 
freedom is small, and some sort of fit would be expected. It should be 
noted that Figures 1 and 2 represent the fit for the entire corpus which 
had equivalent or larger comparative chi-squares than the individual 
sections. Thus similar figures for individual sections of the corpus would 
show fits which were as good or better than those indicated by Figures 1 
and 2. 



Insert Figure 2 about here 



Grammar for Verbal Modifiers 

The grammar for verbal modifiers is presented in Table 22. Twenty- 
nine syntactic types are derivable from the grammar and twenty free parameters 
are used;, when no collapsing due to low predicted frequencies is needed, the 
model has eight degrees of freedom. No transformations were required. 

Insert Table 22 about here 

Table 23 shows that an extremely high percent of the verbal modifiers 
in each section of the corpus is derivable from the grammar, and the 
observed frequencies (Table 2k) show that all derivable types are found at 
least once in the corpus. 

Insert Tables 23 and 2k about here 

Many types in the individual sections of the corpus had zero or very 
low frequencies, and so the chi-square values for these sections were not 
meaningful. When the theoretical probability of a type should be zero, 
any one of the parameters in the product can be zero and the remaining 
parameters can be given the values which give the best fit to the observed 
frequencies of other types. In the individual sections there were enough 
types of zero frequency to enable the grammar to give a perfect or nearly 
perfect fit. The problem is further indicated by the fact that for 
indiv*,dual sections so much collapsing due to low predicted frequencies 
would have been necessary that all individual sections of the corpus with 
the exception of the Scott-Foresman reader and the readers combined would 
have had negative degrees of freedom. The statistical analysis was completed 
only for the entire corpus and is presented in Tables 25 and 2 6. 

Insert Tables 25 and 2 6 about here 
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Fig- 2- Comparison of observed and predicted frequencies 
for verb phrases (entire corpus). 
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TABLE 22 

The Grammar for Verbal Modifiers 



Rewrite Rules 



VM -> (Adv) + J + NP + \ 



N p )\ 
J+NP 
Adv |J 
C+NP V 



VM -4 Adv + 



Adv 

Adv+Adv 

Adv+J+NP 

C+Adv 

L 



™- l+ (jw|) 

VM -» S + (J+NP) 



VM -> I + (I) 



VM -» J + L 



VM -»NP + 



/I J®* 

[ J-*-NP 
l Adv 
\ C+NP 



Parameters 

Rule-Choice Within-Rule-Choice 

Probabilities Probabilities 



*5 

k & 



B1 ] 

Bl, 



32 1 

B2 2 

B S 

B2 4 

B2 5 



B 5l 

B5 2 

B5 3 

B \ 

B5 5 

B % 

*1 

^2 

b4. 



b 5 £ 

b6. 



B2] 

B2 £ 

B2. 



B2 

B2 r 



if 




GO 

irz 

o 



o 



T* 

0 

1 

o 



LA 

% 



03 0 

fn a 
a) *h 

'S rg 

? I 

<D O 

« o 



CO 

ON 



a) 

l CD 
CO 0 
Ph 



C"— 

ON 



§3 

•H cd 
Cj oj 



CO 

CO 

ON 



' W 0 
0 -H 

II 

*H O 
Ph C> 



ON 

On 



KN ^ 
CM 0 



a 

§ 



o 

u 

u 

< 



i 



ON 

GO, 

ON 



CU 



ON 

ON 



0 

A 

<H 

O 

d 

o 

•H 

•P 

O 

0 

CO 

o 

cd 

0 

<H 

o 

c ! 
0 
u 
5n 
0 
Ph 



£ 



vu 

£ 



CM 

$ 



CO 

ON 



O 

9 



5^ 




9 
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OOOVDVO A rj C^ON A LTNCO VO O 00 ON CVJ On C^CO CVJ H KNHHOnONCO 
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TABLE 25 

Maximum-likelihood Estimates for the Entire Corpus 
for the Grammar for Verbal Modifiers 



Parameter 


Value 


A [ 1 1 


.43*2 


A [2 1 


,3566 


A [3 1 


,1 100 


At 4 1 


.0142 


A[S] 


,04«9 


A [ 5 1 


.0133 


Am 


.0127 


Pit n 


.1243 


Blt2 1 


.R757 


B2tn 


.0047 


B2 1 2 1 


.063 6 


R2t 3 1 


,0306 


B2 1 4 1 


.0207 


B2 1 5 'J 


.0 724 


B3t 1 1 


.0947 


B3t2 1 


.0054 


B3[ 3 1 


.0072 


B3[ 41 


.0326 


83151 


,0072 


B3t 61 


.2515 


B4tl 1 


.0157 


B412 1 


.0352 


B4t3 1 


.9491 


B5t I 1 


.1515 


B 5i 2 1 


,2425 


B6t 1 1 


.2159 


B6[2 1 


.7241 



O 

ERIC 



V 



‘ 57 00061 

TABLE 26 



Observed and Expected Frequencies, 

Chi-square Contributions, and Total Chi-square for 
the Entire Corpus for the Grammar for Verbal Modifiers 



OHSERV. 


EXPECT. 


CHI**2 


SOURCE 


1 5 SO 


1655,4 


.0 


J+NP 


s 


R.4 


.0 


JtMP+MP 


ns 


1 13.4 


.1 


J+NP+J+NP 


ss 


SS ,9 


.1 


J+NP+4DV 


33 


37.0 


.4 


.1+ NP+C+ 1}P 


1411 


141 1.0 


.0 


4 0 v 


1 57 


157.0 


.0 


AWtAQV 


0 


9.0 


.0 


AU'/+40V+A OV 


13 


13.0 


.0 


4DV*-A0V+J+M° 


5 4 


64.0 


.0 
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.0 


4DW-L 
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.0 
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1« 


1R.0 


.0 


L+J+ NP 


5S 


5.6, 0 


.0 
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10 
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.0 
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17R 


17R.0 


.0 
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49 


49.0 


.0 


I+I 


62 


62.0 


.0 


J+L 


69 


75.9 


.6 


MP 
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3.4 




NP+ADV 


S 


5.5 


1.1 


NP+J+UP 
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1 .R 




MP+C+i\}P 
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5.2 


2.9 


EXPECTED EREO 
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.4 




NP+VP 


223 


220.7 


.0 


ADVtJ+NP 


1 


1 .2 




ADUfJ+NP+^P 


1 1 


16.1 


i .6 


ADVf JtMP+J+MP 
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9.R 


.1 


ADW-J+ MP+AOV 


9 


5.2 


2.7 


ADV+J+MP+C+MP 


R 


R.O 


.0 


L+ADV 


2 

V 


1.6 




RESIDUAL 


4.64 S 


4646.0 


9.6 


TOTAL 
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The total chi-square shown in Table 26 is extremely low and indicates 
the best fit of the three grammars presented so far. Figure 3 gives a 
further indication of the quality of this fit; the observed and predicted 
lines are nearly identical showing that the fit is close to perfect. 

Insert Figure 3 about here 

Grammar for Statements Without Verbs 

The grammar for statements without verbs was written for such 
utterances as: "Oh, my.", "Hello, Dick*.", and "What a pretty dress’." 

It is shown in Table 27* 

Insert Table 27 about here 

Twenty-two syntactic types are derivable from the grammar and 17 
parameters are used; four degrees of freedom are available whenever no 
collapsing due to low predicted frequencies is necessary. The same 
parameter, Bl^, was assigned to the choice of the vocative whether it- 
occurred after a verbal modifier, an adjective, or a rejoinder; there was 
no a jrriori reason to think that the proportion of utterances such as 
"Over here, Sally." to "Over here." would be different than the proportion 
of utterances such as "Yes, Sally." to "Yes*" Similarly, the same 
parameters, B3^ and B3gj were assigned to the first and second repetition 
of a sound as to the first and second repetition of a number or the 
uttering of a second and third number in a series. 

The observed frequencies of the syntactic types for each section of 
the corpus are shown in Table 28, and the percent of utterances in each 
section of the corpus which were accounted for by the grammar is shown in 
Table 29 • Again a very high percent of the statements without verbs can be 
generated by the grammar, and all derivable expressions occur in the corpus. 

Insert Tables 28 and 29 about here 

The problem of many zero or low frequency types discussed in 
connection with the verbal-modifier grammar reoccurred for this grammar. 

Only the corpus as a whole could meaningfully be analyzed by maximum- likelihood 
estimates and chi-squares. This analysis is given in Tables 30 and 31* 

Insert Tables 30 and 31 about here 
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Fig. 5. Comparison of observed and predicted frequencie 
for verbal modifiers (entire corpus). 
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TABLE 27 

The Grammar for Statements without Verbs 



Rewrite Rules 

S -*K 
S -* VM+(K) 

S -» A+(K) 

S -> R+(K) 

s -* -+NP 
S -♦ I+R 



Parameters 

Rule-Choice Within-Rule-Choice 

Probabilities Probabilities 



A 6 



*8 

A 9 

^0 



B1 1 

Bl- 



B1 1 

EL. 



B1 1 

BL 



B2] 

B2 £ 

B2- 



B 3] 

B3, 



B5 3 

S3, 

B3- 



bL^ 



B 5l 

,B5 2 
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TABLE 30 

Maximum-likelihood Estimates for the Entire Corpus 
for the Grammar for Statements without Verbs 



Parameter 


Value 


At 1 1 


.2787 


A12 1 


.2438 


At 3 1 


.0166 


At 4) 


.0690 


At 5 1 


.1380 


At 6 1 


.1822 


At 71 


.0166 


At 81 


.0129 


At 91 


.0212 


At 10 1 


.0120 


At 1 1 1 


.0092 


Bltl 1 


.3140 


B 1 12 1 


.4860 


B2U 1 


.0400 


B2I2 1 


.1267 


B2I31 


.8333 


BSt 1 1 


.2546 


B3t2 1 


.0926 


BSt 3 1 


.6528 


B4tl 1 


.6429 


B4 12 1 


.3571 


B5tl 1 


.7826 


B5I2 1 


.2174 
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TABLE 31 

Observed and Expected Frequencies, . Chi-square Contributions, 
and Total Chi-square for the Entire Corpus for the Gr ammar 
for Statements without Verbs 



03SERV. 


EXPECT. 


CHI**2 


SOURCE 


303 


303.0 


.0 


K 


129 


136.2 


.4 


VWfK 
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5.0 


.0 


VWfK+VP! 


136 


128.8 


• 4 


VP! 


19 


19.0 


.0 


NP+VM 


125 


125.0 


.0 


NP 


13 


13.0 


.0 


-+NP 


134 


129.2 


.2 


Z 


49 


50.4 


.0 


Z+Z 


15 


18.3 


.6 


Z+7+Z 


11 


8 0 7 


.6 


A 


7 


9.3 


.5 


A+K 


9 


9.0 


.0 


VWfA 


7 


, 11.8 


1.9 


NBR 


6 


4.6 




NBR+NBR 


5 


1.7 




NBR+NBR+NBR , 


11 


6.3 


3.6 


EXPECTED FREQ 
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6.0 


.0 


NP+C+NP 


27 


36.5 


2.5 


R 


18 


18.0 


.0 


R+R 


10 


10.0 


.0 


I+R 


48 


38.5 


2.3 


R+K 


5 


5.0 


.0 


R+VM 


0 


-.0 




RESIDUAL 



LESS THAN 5.0 



1087 1087.0 




13.0 TOTAL 

3 DEGREES OF FREEDOM 
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The analysis shows that the grammar for statements without verbs 
provides a very good fit, comparable to that of the verbal modifier grammar. 
Figure k provides further indication of this fit. 

Insert Figure 4 about here 

The observed frequencies given in Table 28 show some interesting 
differences in the two reading series. The Scott-Fore small series 
introduces number words and counting in the pre- primer and continues usage 
of this vocabulary in the primer; the Ginn series does not introduce 
number words either as adjectives or for use in counting until the reader. 
The Ginn series makes much more use of sounds, for example "Zoom, zoom." 
and "Mew, mew.", than the Scott-Foresman series. The Ginn series also 
makes more use of noun phrases standing alone. 



Grammar for Interrogatives 

The grammar for interrogatives is presented in Table 32 • The grammar 
includes eight rewrite rules and' a transformation with several parts. The 
grammar generates 22 syntactic types and uses 15 parameters; thus, if no 
collapsing were necessary, the model would have 6 degrees of freedom. Note 
that parameters B2 1 and B2^ were used for a choice of "K" or "VM" whether 
the choice occurred at the end of an interrogative which was an inversion 
of a statement or an interrogative whic i used an interrogative pronoun. 

Insert Table 3 2 about here 

Subscripts are used to insure that the noun in the subjective position 
and the verb agree in number and person; a third subscript is attached 
to each- noun- phrase entry to indicate the required case (subjective 
or objective) of Lne noun. Of course, the person and case subscripts 
effect a change in terminal vocabulary only in case the noun phrase is 
a pronoun. Number and person subscripts have been attached to the noun 
phrases in .the objective positions even though there is no verb with 
which they must agree. This was to avoid confusion in subscript order, 
and to correspond to the notation in the final grammar for statements with 
verbs. In that grammar such subscripts are necessary in the transformations. 
In the tables showing the statistical analysis of this grammar, only the 
case subscripts have been indicated; these are necessary to clarify the 
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Fig. Comparison of observed and predicted frequencies 
for statements without verbs (entire corpus). 
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TABLE 32 

The Grammar for Interrogatives 

Parameters 



Rewrite Rules 

Inversions of Statements: 

Q -» M 1( j+® 1( j , i + ( VP 3 +NP^ £ z j) + ({vM>) i 

f» k i 2 l 

Q - > v OT i,d.i +vp 3 +o+vp 3 + l ™ ; 

Q ^ M i,d +OT i,dA +VP 3 +NP k s i,2 +NP m,n 5 l +VP m,n ' 

Q -» M. ,+NP. , . +VP,+A+K l 

ijd i>d)l 3 

Use of Interrogative Pronoun, Advert, or Adjective: 



Q -» IP. , ,+VP, ,+(0+VP,)+VM 

1,J,± 1,J J 

Q IP i,d,2 +M i,d +NP M,3 +VP 3 + (( 1 



Rule-Choice 

Probabilities 



!( K 
VM 



Within-Rule- Choice 
Probabilities 



Bli B2 1 



B1 2 *2 
B1 3 *3 

B3i 

B3o 



*1 

b4„ 



K 1 

K 2 

B2„ 



o 

ERIC 



Q -» IADV+M, ,+NP, , _+VP,+ 

Ijd Ijdj- 1 - J 



NP. 



k ,/,2 



VM 



*7 



B5 1 

S5 2 

B5, 



Q -> IADJ+NP 6 0 4M, .+NP. . .+VP_ An 

k, 4,2 i,j i> jjl 3 o 

in NP. . . : i = 1,2 - number 

i>d>K 

j = 1,2,3 = person 
k = 1,2 a cose 

in VP. .: i = number 

i,d 

j = 1,2,3 = person 

VP^=> infinitive form (verbs lacking infinitive form may not be used) 
Obligatory Transformations : 

If M. is a form of "to be" or "to do", and VP~ is "be", then: 
ijd 3 

b. My+^i ,j, 1 +VP 3 +OT k,£,2 +1W m,n, 1 +VP m,n , j +NP i , j , l +NP k , £, l +NP m, n , l +VP m, n 

o. M y +NP ij j j i +VP 3 +a+K VP i > j +If? i,j,l +A+K 

e. IADV+M i>;J +NP ij;J>1 +VP 3 +(NP k>/j2 )+(VM) -» I®V+VP ij;J +NP ij; j >1 +(NP k>/>1 )+(VM) 

f. IADJ+NP, . „,+M, ,+NP, , ,+VP, -» IADJ+NP. . ,+VP, ,+NP, , - 

k,i,2 i, j i,j,l 3 k,^,l i,j i,j,l 



d * IP 2 +M i J j +NP i i j ) l +VP 3 + 
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type involved, but the other subscripts should be apparent as they are 

specifically dictated by the grammar * 

The transformations were necessary for sentences involving the verb 

"to be"* They change such sentences as "is that be (being) the book I 

wanted?" and "Why are you be (being) the one in the corner?" to "is that 

the book I wanted?" and "Why are you the one in the comer?" 

Note that transformation (a) makes it possible to derive interrogatives 

involving the verb "to be" with no modal in two ways* Such statements (for 

example, "Is she here?") can be derived from the first rewrite rule either 

by choosing "M. to be "is" and deleting the first set of choices, or by 
ij J 

choosing "M. " to be "is" and "VP " to be "be" in which case the 

i ? J 3 

transformation would produce the desired form* For purposes of analysis 
it was assumed that the direct route would be taken* This seemed a logical 
assumption since the transformation is mainly a safety device in case "to be" 
verbs are accidently chosen for both the modal and the main verb* 

Like the previous two grammars, a large number of zero and low 
frequencies for the individual sections made separate analyses meaningless* 
Table 33 shows the observed frequencies for the individual sections, Table 
3^ shows the percent of each section of the corpus accounted for by the 
grammar, and Tables 35 and 36 show the chi-square analysis for the corpus 
as a whole* 

Insert Tables 33“36 about here 

This grammar accounted for a smaller percent of the interrogatives 
in the corpus than the previous grammars did for their respective parts of 
the corpus* The interrogatives were fewer in number and much more 
irregular than the other types which have been analyzed* The interrogative 
grammar generates one type, (M ,+NP* , -,+K), which is not in the corpus* 

An example of a sentence of this type is "May I, Mother?" which is hot an 
unusual English sentence* This type is derivable from the first rewrite rule 
from which "May I?" and 'May I go, Mother?" can also be derived* Interrogatives 
of these latter types are found in the corpus, and so the type 
(M. ,+NP +K) seemed to be within the framework of the implied grammar* 

1 J J 1 J u 9 1 

To have constructed the grammar so that that type was not derivable . would 
have required a much more complicated grammar and more parameters* 
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TABLE 35 



Maximum- likelihood Estimates for the Entire Corpus 
for the Grammar for Interrogative s 



[ 



L 

L 



y 



Parameter 


Value 


AU 1 


.4662 


AI2 1 


<,0356 


A[3 ] 


.0427 


At 4 1 


.0107 


At 3 1 


.0427 


At 6 J 


.2420 


At 7 1 


.1423 


At 8 1 


.0178 


BUI 1 


.2824 


BH2 1 


.5725 


Blt3] 


.1450 


B2t 1 1 


.0704 


B2t2 1 


.4070 


B2t3 ] 


.5226 


BStl ] 


.3000 


B3t2 1 


.7000 


B4 1 1 1 


.2500 


B4 [2 ] 


.7500 


B5C 1 I 


.1500 


B5t 2 1 


.2500 


B5t 3 ] 


.6000 



i 
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TABLE 36 

Observed and Expected Frequencies, Chi-square 
Contributions, and Total Chi-square for the Entire Corpus 
for the Grammar for Interrogatives 



OBSERV. 


EXPECT, 


3 


9.9 


7 


19.3 


42 


39.2 


0 


1.3 


2 


2.6 


8 


5.3 


16 


7.7 


28 


15.1 


25 


30.5 


3 


3.0 


j 5 


6.9 


7 


7.0 


12 


12.0 


3 


3.0 


9 


9.0 


3 


3.0 


{'• 6 


6.0 


;; 52 


35.5 


i 4 


4.8 


f :; 12 


27.7 


24 


24.0 


f 6 


6.0 


i 1 0 


10.0 


| 5 


5.0 


f; ■ 4 


4.8 


¥■ . 281 


281 .0 



CHI* *2 SOURCE 

4.8 WtNP(l) 

7.9 M+NP(1)+VP 

.2 Wf NP(1) + VP+NP(2) 

Mf NP( 1 )+K 
M+NP ( 1 )+VP+K 
1.4 Wf NP ( 1 )+VP+NP (2)+K 

8.8 WfNP(l)+VM 
11.1 MfNP(l)+VP+VM 

1.0 Wf NP(1 )+VP+NP(2)+VM 
Wf NP ( 1 5+VP+0+VP+NP (2) 

.5 EXPECTED FREQ. LESS THAN 5.0 

.0 WfNP(l>+VP+0+VP+VM 
.0 Wf NP ( 1 I+VP+NP (2)+NP < 1 )+VP 
Wf NP (1 I+VP+A+K 
.0 IP ( 1 I+VP+VM 

IP<l)+VP+0+VP+VM 

.0 EXPECTED FREQ. LESS THAN 5.0 

7.6 lP(2)+WfNP<l)+VP 
IP (2) + Wf NP (1 )+VP+K 

8.9 IP(2)+Wf NP(1)+VP+VM 
.0 IADV+Wf NP(1 >+VP 

.0 IADV+Wf NP(T I + VP+NP (2) 

.0 IADV+Wf NP (1 )+VP+VM 
.0 IADJ+NP(2)+WfNP(l)+VP 

RESIDUAL 

52.3 TOTAL 

2 DEGREFS OF FREEDOM 




O 



I EBJC 
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Regarding the inclusion of unobserved types, it should be noted that 
in a probabilistic gram m ar the criterion of a grammar not generating 
utterances not found in the corpus is not as important as the criterion 
that the probabilities of such utterances be low. If the theoretical grammar 
includes but is not identical to the grammar (the actual grammatical types) 
of the corpus, a good probabilistic fit could be ootained, and there is 
no real need, once a probabilistic viewpoint is adopted for the two grammars 
to be identical. For the interrogative grammar the probability of the 
unobserved type is .005 which is sufficiently low. 

The total chi-square shown in Table 36 indicates that this grammar 
is not as good as the verbal-modifier grammar or the statements-without-verbs 
grammar, but is better than the noun-phrase grammar which had a larger 
comparative chi-square and the same number of degrees of freedom. It is 
not significantly different from the verb-phrase grammar. Figure 5; however, 
shows that in terms of rank order the fit of this grammar is not as good 
as any of the previous grammars. 

Insert Figure 5 about here 



Compound Statements 

A compound statement was defined to be any pair of statements 
derivable from the grammar for statements with verbs (next section) and 
connected by one of the words: "and", "but", or "so”. ("Or" was not 

used as a conjunction in this corpus.) This definition includes such 
statements as "Jane ran home, but Spot did not go with her.", and "I want 
to stay here, and maybe I can." Imperatives such as "Run and help Mother . " 
were considered simple statements with compound predicates; these, are 
derivable from the grammar for statements with verbs and as such are not 
compound statements. Under the stated definition, the statement "Run and 
help Mother and take Spot with you." would have to be considered a compound 
statement, but such combinations did not occur. 

The entire corpus contained only 86 compound statements, a total of 
1.1 $ of the utterances. The syntactic types ranged in frequency from one 
to eight, the average being about four. The types were so dissimilar that 
any grammar would have been little more than a list of observed types; 
in addition, the low frequencies would have made any statistical analysis, 
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Fig, 5 « Comparison of observed and predicted frequencies 
for interrogatives (entire corpus) . 
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including one for the corpus as a whole, invalid. A grammar consisting 
of the statement with verb grammar with the option to connect any two 
generated types by "and", "but", or "so" would have generated far more types 
than were contained in the corpus; while the probabilities of the 
observed types would have been low, the probabilities of the observed types 
would also have been low (due to the low observed frequencies), and the 
result would be a very poor fit. Thus, it was decided that no grammar 
would be written for compound statements. 

Grammar for Statements with Verbs 

As would be expected this grammar is the most complicated; it 
generates 131 utterance types and uses 43 parameters. This would allow 87 
degrees of freedom if no collapsing due to low predicted frequencies were 
necessary. The grammar is presented in Table 37; it contains 24 rewrite 
rules and five obligatory transformations. 

Insert Table 37 about here 

The rewrite rules for the statement grammar have been divided into 
three categories for reading convenience: l) statements with simple subjects 

and predicates and without embedding, 2) statements with compound subjects 
or compound predicates and without embedding, and 3) statements with embedding,, 
The rules for embedding generate three basic utterance types: embedded 

statements without adverbial or subordinate connectives such as ”1 can 
help mother do this." (rules l4-l6), statements with infinitive phrases 
such as "I can help mother to do this/ (rules 17-22), and statements with 
subordinate conjunctions such as f, He did not know that Sally was ready.” 
and "Tom did not know how to dance.” (rules 23 and 24). 

Subscripts have been used as they were in the grammar for interrogatives 
to insure that subjective nouns and verbs agree in number and person, and to 
indicate the case of nouns. For this grammar number and person subscripts 
attached to noun phrases in the objective case were needed for transformation 
4 as well as to avoid confusion of subscript meaning. 

Any choices which involved the same subphrases and occurred in 
relatively the same position in different rewrite rules were assigned the 
same parameters. For example, whenever the choice "(VM)" occurred after 
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TABLE 37 

The Grammar for Statements with Verbs 



Rewrite Rules Parameters 





Rule-Choice 

Probabilities 


Within-Rule-Choice 

Probabilities 


I. Statements with single subject, single 
predicate, and no embedding: 








A 1 


B1 1 B2 1 
Bio B2 

-3 -3 


2 . S-»jic l)+NP, . ^VP, .+(VM)+(K) 
UVM )j 1,3,1 1,3 




B3 1 

B3 2 b*^ B5 X 

B3 3 b4 2 B5 2 

b 3 4 


3. S -» (VM) + 


A 3 


B 2 

*1 B 2 g 

b 6 2 32 ^ 


k. S -* (VM)+VP. +(VM)+(K) 


\ 


b 6 i 

b 6 2 b4 2 B5 2 


5 * S - NP i,j,l +VP i,j +mNP k,/,2 


A 5 




6. S -» UP, , ,+VMfVP, ,+VM 


a 6 




7 . s 

8 ’ S -»i,;|,l +VI, i.3 +(VM)+A+ (P 

9. S -4 VP, -+VM+A+K 
l,c 


*7 
A 12 

a 13 


B 2 

B9 1 B2 2 

B9 2 B2, 

3 



O 

ERIC 



UttUtifl, 

TABLE 37 (continued) 
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II. Statements with compound subjects or 
compound predicates: 

10. s ^VP 1)2 ^.VP 1>2+ (1 



3 ^ HP i >J> l +VI 'i.J +C+VF i.j +( " P k > 1.2 )+ (™> 

12. S (VM) + BP i>3>1 ^BP kj<jl +VP 2> ^(l(P mjnj2 )l(VM) 

13. S 

where j = 2 if NP. . , is deleted 
i»t) > J- 

III. Statements with embedding: 

14. S -* (VM)+(NP. . J+VP, ,+NP. . _+VP_+(VM) 

YLytyC. J 

where j ■ 2 if xhP. . is deleted 

0 9 J- 

15 ’ 3 (®l,J,l)' , ' VP i,j +OT 'k ( ^2 +VI '3 +ra 'm,n,2 + < VM ’ 

where j = 2 if NP. . .. is deleted 



10 



11 



15 



\6 



16. S - («P tfJ>1 M» 1>J + j - »W+ V V*(l 

where j = 2 if NP is deJ.eted 

17. S -* (VM)+NP i}J}1 +VP i}J 40+VP 3 +(NP k ^ }2 )+(VM) 






VM // 17 



*18 



l8 * 



^m,n,2 )\ 

W m,n,2 +VP 3 )/ +(VM) 



19 ‘ S 

20. S -*NP, , -+VP, .+VM-HO+VP- 
i»j»l i»J 3 

21 - S - i " r i ; J ; l-' vp i,i^ +VI> 3 + ®k,i,2 +TO 3 



ERJO- S-NP . ,+VP ( ,+A+0+VE>(VM) 

± 9u9 ± *•90 » 



19 

^o 

*21 

A 22 

A l4 







B2 




B7 i 


B2 




CVJ 

pq 


A 




B7 i 


V 




CVJ 

£q 


E4 2 


b6 1 


B 7i 


*1 


b6 2 


CVJ 

pq 


*2 


b6i 


B8 1 


*1 


b6 2 


b8 2 


E4 2 



b6 1 b8 1 b4 

b6 0 b8„ b4„ 



bSj^ b4 1 
B8 2 B4 2 







Bll. 


^1 


B10 1 


Bll, 


B8 2 


B10 2 


i 

Bll. 


*1 


‘*1 


*1 


*2 


B7 2 


^2 



B12 

B12 £ 

B12. 



B4 ] 

b4„ 



b4 ] 

b4. 



7.8 
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TABLE 37 (continued) 

23. S -> NP. , t+VP, ,+CON+O+VP, 

ljj o 

24 - s ^“ p i,j s l +VP i,J +< " I ’k ( i ( 2 )+(,M)+C0N+,,P m > n > l + 

( VP 

m,n 

VP +W 
m,n 



A 23 






B13. 


^4 


*1 


% 


B13, 


B7 2 


E4 2 


1 

B13, 



in NP. . , = i = 1,2 = number 

1 > ti 

J » 1,2,3 B person 
k = 1,2 = case 

in VP. , = i = 1,2 = number 

i 9 w 

j = 1,2,3 = person 

VP, => infinitive form (verbs lacking infinitive form 
^ may not be used) 

Obligatory transformations: 

1. If "VM" is an adverbial phrase of location or direction (for example: here, 

up, away, then up, off to the store), or if "VM" is one of these descriptions: 
"hippity-hop" , "faster and faster", "swish^, "swish, swish", "hop, hop", 
"splash, splash", "left foot first", "right foot first", and if "VP. ." 

is a form of "to be" or one of these verbs of locomotion: x, “ 

"walk", ".lump", "go", "run", "come", "roll", "buzz", and if NP, . 4 P, then 

(C)+VMfNP, , t+VP. ,+... -> (C)+VMfVP, .+NP. 

2. If "VM" is "so" and "VPj^ " is a form of "to be" or "to do", or is: "can", 

"could", "may", "will", "shall", or "must", then 

VM+NP, ,+VP, -» VMtVP, .+NP. , . 

1,0,1 1,0 i $ J > 

3. If "VP. " is a form of "to be", then 

I > J 

• • • +VP , . +NP. a — + • • • “) • • • +VP . . +NP, a - + • • • 

IjJ 1,0 

4. If "HP "is "what + NP", then 

f > O 9 1 

+NPi , ^ ^ i+V^i , j , i + ^k,^,2 + * "* •+NPj c ,^,,2+NPi, j ,i+VPi, ^+. 

5. If "VP^ is a form of: "to wish", "to think", "to say", "to guess", or 

'•'to know", then 
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the main verb (rules 2, 4, 11, 12, 13 , l4, 15, 17 > l8, 22, and 24), the 
parameters B4^ (probability of accepting) and B4g (provability of deleting) 
were assigned/ whenever the choice "(VM)" occurred before the main verb 
(rules 3, 4, 12, 13 , l4, and . 17 ), the parameters 1 36^ (probability of 
accepting) and B 6 ^ (probability of deleting) were assigned. Parameter 
notation in Table 37 is the same as has been used previously; if two or 
more sets of parameters which sum to one are necessary for the same rule, 
the first vertical column of within-choice parameters contains probabilities 
for the first (left-most) set of choices in the rule, the second column 
contains probabilities for the second set of choices, etc. 

In certain instances inversions of the form specified by the rules 
regularly occurred. For example ’’Here comes Sally.’’ was used instead of 
"Here Sally comes.’’, and "Away ran Dick." was used instead of "Away Dick 
ran." The first transformation specifies exactly when these inversions 
occur; the adverbial phrase is usually one of location or direction 
("here", "up", "off to the store"), the verb is a form of "to be" or a 
verb of locomotion ("run", "come"), and the noun phrase is something other 
than a pronoun. The structure "Here he comes." is always used rather than 
"Here comes he." The second transformation handles a second, type of 
inversion; it changes expressions like "So I can." into "So can I." 

The third transformation changes the noun i.i the objective case 
following a form of "to be" into the subjective case; for example, it would 
change "That is him." into "That is he." The fourth transformation would 
change an utterance like "That is what a pretty dress." into "What a pretty 

dress that is." Note that a noun phrase of the form "what + NP" was 

derivable from the noun- phrase grammar by deriving (T+A+, ..+N)> choosing 
the first "A" to be "what", and applying an obligatory transformation. The 
last transformation changes the verb of an embedded utterance following 
a verb of thought or observation from the infinitive form to a form which 

agrees in number and person with the subject of the embedded clause. For 

example, it would change "I think he want to come." into "I think he wants 
to come." 
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The statistics for this grammar are presented in Tables 38 through 
42. The recurring problem of zero and low frequency types made a chi-square 
analysis for the Ginn pre-primer and the Scott -Fore sman pre-primer invalid, 
but all other sections of the corpus including the pre-primers combined were 
analyzed. The observed frequencies for the Ginn and Scott-Fore sman pre-primers 
are given in Table 38. 

Insert Tables 38-42 about here 

The grammar accounts for 85*5$ of all statements with verbs in the 
corpus as a whole. In each of the three major sections (the pre-primers," 
the primers, and the readers) a greater precent of the Ginn utterances 
than the Scott -Fore sman utterances were derivable from the grammar. The 
Scott-Fore sman books contain a greater variety of utterance types and 
display more irregularity of syntactic pattern than the Ginn books. 

This grammar generated some types which did not appear in. the corpus 
as can be seen from the observed frequencies for the entire corpus in Table 
42. However the probabilities of such types are low (the largest is .003) 
and, as has been explained previously, from a probabilistic viewpoint this 
is completely acceptable. 

The degrees of freedom available for each section of the corpus were 
sufficiently different to require F-tests for purposes of comparison. 

These tests showed no significant difference in fit among sections with 
the exception of the pre-primers combined which had a significantly poorer 
fit. The pre-primers combined had a large number of zero or low frequency 
types and very few degrees of freedom. There is no significant difference 
between the fit of the grammar to the Ginn series and the fit to the 
Scott-Foresman series, so the greater syntactic variety in the Scott-Foresman 
series implied by the comparative percents derivable from the grammar is due 
mainly to low frequency types not accounted for by the grammar. 

The similarity of high frequency syntactic patterns in the two 
series is indicated in Table 43 which shows the ten highest frequency 
types of each section of each series. 



Insert Table 43 about here 
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TABLE 38 



Observed Frequencies of Types of Statements with Verbs 
in Sections not Included in the Analysis 



TYPE 



G! MM s -F 

°RE-PRTMFR PRE-PRIMER 



*l 0 <l >+</»+ M° (2> 


7S 


S9 


M° ( 1 >*M/p+MP<2)+K 


3 


1 


M p ( 1 )+VP+'IPCP) + yi 


20 


24 


C+ M°(l )+V°+M®(2> 


0 


G 


C+ MP(1 >+yP+MP(?.)+K 


n 


0 


C+MPO )+\P+MP(2)+V"1 


n 




VM4- M®(1) + VP+MP(2) 


0 


0 


VM+ MP( 1 ) + VP+ MP <2 )+K 


0 


0 


VW- MP( 1 ) + 'J?+ MP (2 )+ VM 


n 


0 


MP C l )+ VP 


14 


20 


MP(1 )+ VP+VM 


44 


31. 


MP ( 1 )+VP+K 


1 


0 


MP( 1 )+VP+VM4-K 


2 


0 


R+ MP(1 )+ VP 


0 


0 


R+ MP(1 )+ VP+VM 


0 


0 


R+ MP ( 1 )+ VP+K 


0 


0 


R+ MP ( 1 )+ VP+ VMt K 


0 


n 


C+ MP < I ) + VP 


0 


0 


C+MPC1 )+VP+VM 


0 


•w 

O 


C+ MP ( 1 HVP-+K 


0 


0 


C+MP(1 >+ VP+VMtK 


0 


0 


VM+ MP( 1 )+VP 


41 


21 


VW*MP<l)+VPf VW 


14 


6 


VMMP< D + VN-K 


4 


1 


VlMt MP( 1 )+ VP+ VMtK 


0 


n 


V»*MPC2> 


$2 


is 


VP-MP(2>+K 


2? 


0 


'P+MP(2) + VM 


IS 


2 


VMfVP+MP(2) 


0 


n 


VM+> '/P+ MP(2 )+K 


0 


n 


VM* VP+ M° (2 )+ VM 


0 


0 


V® 


s 


!« 



O 

ERIC 

ijflinaffamiaaa 
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TABLE 38 (continued) 



Type 



Ginn S-F 

Pre-Primer Pre-Primer 



v°f v*i 


24 


5P 


VPf V!» h-K 


5Q 


35 


VWf VPf VMf K 


0 


1 


V p f K 


83 


' 27 


V* J+ VPf K 


0 


0 


VWf VP 


0 


1 


VMf VPf VM 


0 


2 


NP(1 )f VPfVWfNP<2) 


0 


0 


s|P < 1 )f VW*VPf VM 


0 


0 


Cf VWf NP(1) + VP 


1 


0 


V p fCfV? 


24 


1 


VPfC+VPf NP(2 ) 


42 


4 


v°+c+ vpf vm 


5 . 


2 


VP+C+VP+K 


5 


0 


V p fCf VPf N^PJf VM 


! 


2 


V p fCf V®f 'IP (2 )f K 


2 


0 


N p ( 1 )f VP+C+ VP 


n 


2 


SIP < 1 >+■ VPfCf VPf NP (2 > 


! 


0 


VP ( 1 )f VPfCf v p + VM 


0 


n 


NP ( 1 >f VPfCf VPf SJP (2 )+ VM 


0 


0 


MP ( 1 )+C+ MP ( 1 )+ VP 


4 


1 


NP(1 >fCf NPtl HVPf NP<2) 


0 


3 


NP(1 >fCf N?(l )+ VP+ NP(2)f VW 


0 


0 


NP(I >fCf NPtl >f VPf VM 


1 


1 


VMfNP(l)fCf NP( 1) + VP 


1 


0 


VMf MP( 1 )+C+ MP( 1 )+ VP+ 'IP (2 ) 


n 


0 


VMf NP( l)fCf NP( 1 )f VPf NP (2 )f VM 


0 


0 


VMf MP< l)fCf NP< l)f VPf VM 


0 


0 


VP+ NP(2)fCf NP(2) 


ll 


2 


VP+ NP(2)fCf NP(2)f VM 


0 


0 


MP(1 )f VPf NP(2)fC+ NP(2) 


3 


5 


MP( 1 )+ VPf NP(2)ffcf NP ( 2 ) f VM 


0 


0 


VWf VP+ NP (2 )f Cf NP (2 ) 


0 


0 


VWf VPf NP (2 )f Cf NP (2 )f VW 


0 


n 


VWf NP ( 1 )f VPf NP (2 )f Cf NP (2 ) 


n 


0 


VWf NP( 1 )f VPf NP (2 )f Cf NP (2 >f VW 


0 


0 
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TABLE 38 (continued) 



Type 

NIP (I )+V°+A 
NJ°(1 j+yD+yvn-A 
MP(1 )+V°+A+K 
Npd )+VP+V*H-A+K 

M°( 1 VP+4+ V*1 

Npd )+ V°+V'1M+VM 

N|»d )+VP+A-*-0+VP 

gp ( i )+ yo+M- o-*. ypf w 
VP+MP(2)+VP 

v°+ <p(2)+vp+ vm 

MP ( I )+ VP+NP(2) + V? 



Ginn , S-F 

Pre-Primer Pre-Primer 



2 
n 
o- 
o 

o o 

n n 

n o 

n n 



.8 

0 

1 

o 



0 0 

1$ 1 

S 2 

0 1 



MPd )+V?+>P(2) + VPf m 1 1 

V!»H-VP+ VP(2)+VP 0 0 

V"H-V?+NP(2)+VP+VM 0 0 



VMf MP( 1 ) + VP+ MP (2 )+ VP 



V!*H- >)P ( 1 ) + VPf NIP (2 )+ VP+ VM 0 0 

VP+ NJP ( 2 ) + VP+ MP (2 ) A 4 

V°+MP(2) + VP+N)P(2)+ VM 0 0 

NPd )+V°+NP(2) + VP+NJP(2) 1 0 

NJ°d )+VP+N°(2) + VP+>iP(2)+VM 0 0 

'P+MP(2) + M D (1) + VP .2 2 

V B +MP(2) + M°(1) + VP+V!1 1 1 

vp+nj°(2)+mp( n+vp+o+vp o n 



ipd )+VP+MP(2)+NJP( 1) + VP * $ 

MP(1 )+VP+NP(2)+NP(l)+VP4-VM 0 1 

MP(1 )+VP+NJP(2) + ’4P( l) + V°+0+VP 0 0 

VP+RP(2) + *P(1) + VP 0 0 

VP+PP(2) + MP(l) + VPfVM 0 0 

VP+RP(2) + NJP( D + VP+O+VP 0 0 

UPd )+VP+RP(2) + MP(D + VP 0 0 

NPd)+VP+RP(2) + NP(D + VP+VM 0 0 

MPd )+VP+RP(2)+MP( D + VP+O+VP 0 0 

MPd )+ vp+o + vp io s 

NPd )+VP+0+VP+NP(2) 9 P 

*IP(1 )+VP+0+VP+NP (2 )+VM 0 1 



i me 
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TABLE 38 (contined) 



i 



\ 

} 



S 

| 

\ 

\ 

I 

! 



. Type 



Ginn S-F 

Fre-Primer Pre-Primer 



NP d )+ VP+0+VP+ VM 12 7 

VM+ MP ( 1 ) + VPf 0+ VP 0 1 

VI*H-MPd) + VPfOVP+MP<2) 0 

vm-»pn)+vp+cH-vp+"jp(2)+\;M o 

WH-MPd) + VPM>f VP+VM 0 

MP C 1 )+VP+MP(2)+0+VP 1 

MPd >+VP+MP(2)+0+VP+ MP<2) 0 

MPd )+VP+\JP(2) + 0+\/Pf MP(2HVP 0 



MPd )+VP+ MP(2)+0+VP+WI 0 

MPd HVP+MP(2)+0+VPfMP<2HVM 0 

M p d )+VP+MP<2)+0+VP4-MP<2)+VP+Vt 0 
M p d )+C+ MP d )+ VP+0+ VP+- VM 0 



MPdHVP+WfO fVP n 

M°d )+VP-*»Q+\/P+ MP(2)+VP 0 

M p d )+ VP+COM+O+VP 0 

!NJP (1 )+V?+C0M+M p < 1) 0 

MPd )+VP+COM+MPd) + VP 0 

MPd HVP+COM+MPdJ + V 01 *- VM 0 

MPd )+VP+MP<2)+C0M+M p < 1) 0 

MPd HVP+MP(2)+C0M+MP( 1) + VP n 



MPd )+VP+MP(2)+C0M+M?< 1 ) + VPf VM 0 

MPd )+VP+ VW-CO Mf NPd ) 0 

MP ( 1 )+ VP+ VMfCO M+ MPd )+ VP 0 



MPd )+VP+VW*-C0 Mf NPd ) + VP+VM 0 

MPd )+VP+MP(2) + Vl»H-C0Mf MPd ) 0 

MP(l)+VP+MP(2) + V!»H-C0iMf MPd )+VP 0 
MPd )+VP+MP(2) + VI1f CO^ MPd )+VP+VM 0 
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TABLE 40 



Maximum-Likelihood Estimate's for Sections of the Corpus 
Included in the Analysis for the Grammar for Statements 

with Verbs 



Parameter 


Pre-Primers 

Combined 


Ginn 

Primer 


S-F 

Primer 


Primers 

Combined 


Ginn 

Reader 


S-F 

Reader 


Readers 

Combined 


• All 
Combined 


A[l] 


.1875 


.2735 


.2579 


.2670 


.2913 


.2778 


.2862 


.2582 


A[2] 


.1884 


.2724 


.2453 


.2611 


.3562 


.3102 


.3389 


.2808 


A[3] 


.1082 


.0291 


.0425 


.0347 


.0216 


.0240 


.0225 


.0454 


A[4] 


.2743 


.1637 


.1541 


.1597 


.1110 


.0802 


. .0995 


.1574 


A[5l 


.0000 


.0000 


.0000 


.0000 


.0029 


.0108 


.0059 


.0027 


A [6] 


.0000 


.0011 


.0016 


.0013 


.0022 


.0024 


.0023 


.0015 


A[7] 


.0009 


.0011 


.0000 


.0007 


.0043 


.0012 


.0032 


.0019 


A[8] 


.0821 


.0191 


.0157 


.0177 


.0151 


.0072 


.0122 


.0294 


A[ 91 


.0028 


.0135 


.0126 


.0131 


.0115 


.0132 


.0122 


.0104 


A[10] 


.0103 


.0213 


.0330 


.0262 


.0231 


.0395 


.0293 


.0241 


A[ll] 


.0196 


.0045 


.0142 


.0085 


.0130 


.0108 


.0122 


.0127 


A[12] 


.0084 


.0392 


.0314 


.0360 


.0260 


.0251 


.0257 


.0251 


A[13] 


.0000 


.0000 


.0000 


.0000 


.0029 


.0012 


.0023 


.0010 


A[14] 


.0000 


.0123 


.0031 


.0085 


.0036 


.0024 


.0032 


.oo4i 


A[15] 


.0280 


.0314 


.0236 


.0281 


.0187 


.0311 


.0234 


.0259 


A[l£] 


.0084 


.0078 


.0142 


.0105 


.0115 


.0251 


.0167 - 


.0129 


A[17] 


.0196 


.0314 


.0283 


.0301 


.0130 


.0108 


.0122 


.0195 


A[18] 


.0504 


.0617 


.0865 


.0720 


.0433 


.0826 


.0581 


.0608 


A[19l 


.0112 


.0135 


.0267 


.0190 


.0079 


.0216 


.0131 


.0145 


A[20] 


.0000 


.0000 


.0000 


.0000 


.0000 


.0060 


.0023 


.0010 


A[21] 


.0000 


.0000 


.0000 


.0000 


.0036 


.0000 


.0023 


.0010 


A[22] 


.0000 


.0034 


.0047 


.0039 


.0022 


.0000 


.0014 


.003.9 


A[23l 


.0000 


.0000 


.0000 


.0000 


.0056 


.0000 


.0023 


.0010 


A[24] 


.0000 


.0000 


.0047 


.0020 


.0115 


.0168 


.0135 


.0068 


Bl[l] 


.0448 


.0123 


.0427 


.0245 


.0668 


.1078 


.08l8 


.0570 


Bl[2] 


.0000 


.0533 


.1159 


.0784 


.1436 


.1595 


..1494 


.1020 


KL[3l 


.9552 


.9344 


.8415 


.8971 


.7896 


.7328 


.7689 


.8410 


B2[l] 


.0845 


.0280 


.0588 


.0405 


.0285 


.0681 


.0429 


.0521 


B2[2] 


.1739 


.2609 


.3620 


.3020 


. r r : 


.4050 


.3883 


.3098 


B2[3l 


.7415 


.7112 


• 5792 


.6575 


.5927 


.5269 


.5688 


.6381 


B3[l] 


.0000 


.0000 


.0577 


.0226 


.0061 


.0347 


: .0159 


.0155 


B3[2] 


.0149 


.0905 


.0321 


.0677 


.0931 


.0425 


.0757 


.0643 


B3[3l 


.4307 


.3621 


.3590 


.3609 


.3704 


.4363 


.3931 


.3892 


B3[4] 


.5545 


.5473 


.5513 


.5489 


.5304 


.4865 


.5153 


.5310 


B4[l] 


.5000 


.5736 


.6336 


.5989 


.6887 


.6503 


.6739 


.6126 


B4[2] 


.5000 


,4264 


.3664 


.4011 


.3113 


.3497 


.3261 


.3874 


B5[l] 


.3891 


.1825 


.1968 


.1882 


.1142 


.0828 


.1037 


.1964 


B5[2] 


.6109 , 


.8175 


.8031 


.8118 


.8858 


.9172 


.8963 


.8036 


B6[i] 


.0114 


.0612 


.1422 


.0974 


.1219 


.2277 


.1654 


.0922 


B6[2] 


.9886 


.9389 


.8578 


.9026 


.8781 


.7723 


.8346 


.9078 


B7[l] 


.4679 


.4078 


.4330 


.4200 


.4069 


.3835 


.3957 


.4211 


B7[2] 


.5321 


.5922 


.5670 


.5800 


.5931 


.6165 


.6043 


.5789 


B0[l] 


.3333 


.5373 


.4706 


.5085 


.6410 


.7077 


.6713 


.5351 


B8[2] 


.6667 


.4627 


.5294 


.4915 


•3590 


.2923 


.3287 


.4649 


B9[l] 


.0000 


.0000 


• 3000 


.1091 


.3333 


.1905 


.2807 


.1818 


B9[2] 


1.0000 


.0000 


.7000 


.8909 


.6667 


.8095 


.7193 


.8182 


B10[l] 


.0000 


.2143 


.1111 


.1739 


.7222 


.4444 


.6296 


.2660 


B10[2] 


1.0000 


.7857 


.8889 


.8261 


.2778 


.5556 


• 3704 


.7340 


Bllfl] 


.0000 


.0714 


.0556 


.0652 


.1111 


.1111 


.1111 


.0638 


Bll[2] 


.1429 


.1786 


.1111 


.1522 


.1667 


• 3333 


.2222 


.1702 


Bll[3] 


.8571 


.7500 


.8333 


.7826 


.7222 


.5556 


.6667 


.7660 


B12[l] 


.4167 


.5000 


.4118 


.4483 


.1818 


• 3333 


.2759 


.3714 


B12[2] 


.0000 


.0000 


.0000 


.0000 


.0000 


.0000 


.0000 


.0000 


B12[3l 


.5833 


.5000 


.5882 


.5517 


.8182 


.6667 


.7241 


.6286 


B13[l] 


.0000 


.0000 


.6667 


.6667 


.4375 


.2857 


.3667 


.3939 


B13[2] 


.0000 


.0000 


.3335 


• 3333 


.0625 


.3571 


.2000 


.2121 


B13[3l 


.0000 


.0000 


.0000 


.0000 


.5000 


.3571 


.4333 


.3939 
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TABLE 4l 






Comparison of Total Chi-Squares 
for the Grammar for Statements with Verbs 




Text 


No. of Statements 
Accounted For 


Total 

Chi-Square 


Degrees 
of Freedom 


Pre-Primers Combined 


1072 


364.7 


2 


Ginn Primer 


892 


249.6 


8 


Scott-Foresman Primer 


636 


254.2 


7 


Primers Combined 


1528 


478.3 


26 


Ginn Reader 


1387 


406.3 


25 


Scott-Foresman Reader 


835 


206.0 


15 


Readers Combined 


2222 


729.2 


44 


All Combined 


>+822 


1644.0 


61 
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TABLE 42 

Observed and Expected Frequencies, Chi-Square Combinations, and Total 
Chi-Squares for each Section of the Corpus Included in the Analysis 
of the Grammar for Statements with Verbs 



FRE-FRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


144 


142.4 


.0 


NP < 1 J + VP+NP (2) 


4 


16.2 


9.2 


NPCl J+VP+NP <2)+K 


44 


33.4 


3.4 


NP (1 J+VP+NP (2 J+VM 


6 


6.7 


.1 


C+NP < 1 J+VP+NP (2) 


0 


.8 




C+NPC1 J+VP+NP (2 J+K 


$ 


1.6 




C+NP(1 J+VP+NP (2 J+VM 


0 


.0 




VWtNPd J+VP+NP (2) 


0 


.0 




VWtNPU J+VP+NP C2J+K 


0 


.0 




VM4-NP<I J+VP+NP (2J+VM 


34 


34.2 


.0 


NP< 1 J+VP 


75 


34.2 


48.6 


NP( I J+ VP+VM 


1 


21.8 


19.8 


NPCl J+VP+K 


2 


21.8 


18.0 


NPCl J+VP+VMtK 


0 


.0 




R+NIPC1 J+VP 


0 


.0 




R+NP(1 J+ VP+VM 


0 


.0 




R+ NPCl J + VP+K 


0 


.0 




R+NIPC1 J+VP+VMtK 


0 


.9 




C+N;P(1)+VP 


3 


.9 




C+NPC1 J + VP+VM 


0 


.6 




C+NP < 1 J+VP+K 


0 


.6 




C+ NP ( l ) + VP+ VMtK 



6 


5.3 


.1 


EXPECTED FREQ. LESS 


THAN 


5.0 


62 


26.6 


47.2 


VWt NPCl J+VP 






20 


26.6 


1.6 


VM+ NPCl J+VP+VM 






5 


16.9 


8.4 


VMt NPCl J+VP+K 






0 


16.9 


16.9 


VWt NPCl J+VP+VMtK 






78 


85.0 


.6 


VP+NP(2) 






23 


9.7 


18.3 


VP+NP (2J+K 






15 


19.9 


1.2 


VP+NPC2J+VM 






0 


1.0 




VMtVP+NP (2) 






0 


.1 




WH-VP+NPC2J+K 






0 


.2 




VWt VP+NPC2 J+VM 






24 


88.8 


47.3 


VP 






82 


88.8 


.5 


VP+VM 






94 


56.5 


24.8 


vp+vm-K 






1 


.7 




VMtVP+VM+K 






90 


56.5 


00 

• 


VP+K 






0 


.7 




VWtVP+K 






1 


1.0 




VWt VP 






2 


1.0 




VMt VP+VM 






0 


.0 




NP( 1 ) + VP+VM+NP(2) 






0 


.0 




NPCl J+ VMt VP+VM 






1 


1.0 




C+ VM+NPC 1 J+VP 






5 


5.7 


.1 


EXPECTED FREQ. LESS 


THAN 


5.0 
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TABLE 42 (continued) 






25 


34.7 


2.7 


VP+C+VP 


46 


30.5 


7.8 


VP+C+ VP+NP (2) 


7 


8.1 


.2 


VP+C+VP+VW 


5 


4.0 




VP+C+VP+K 


3 


7.2 


2.4 


VP+C+VP+NPC2)+VPI 


2 


3.5 




VP+C+ VP+NP C2)+K 


7 


7.4 


.0 


EXPECTED FREQ. LESS THAN 5.0 


2 


.8 




NPC 1>+ VP+C+VP 


1 


.7 




NP(1 )+VP+C+VP+NP (2) 


0 


.8 




VPCH+VP+C+VP+VM 


0 


.7 




NP C 1 )+ VP+C+ VP+NP (2)+VPI 


5 


2.9 




NPC1 )+C+NP(l )+VP 


8 


5.9 


.8 


EXPECTED FREQ. LESS THAN 5.0 


3 


2.5 




NP C 1 >+C+NP C 1 )+VP+NP (2 ) 


0 


2.5 




NP C 1 >+C+NP C 1 )+VP+NP C2)+VM 


3 


5.1 


.9 


EXPECTED FREQ. LESS THAN 5.0 


2 


2.9 




NP(1 )+C+NP(l )+VP+VM 


I 


.0 




VMfNPCl )+C+NP (1)+VP 


0 


.0 




VWf NPC1 )+C+NPCl 5+VP+NP (2) 


0 


.0 




VWf NP C 1 ) +C+NP ( 1 ) +VP+NP (2 )+VM 


0 


.0 




VHf NFC 1 )+C+NP (1 5+VP+VM 


IS 


6.9 


5.3 


VP+NP C25+C+NPC2) 


0 


6.9 


6.9 


VP+NP C2)+C+NPC2)+VP1 


8 


3.5 




NP ( 1 J+VP+NP (25+C+NP (2) 


11 


6.5 


3.2 


EXPECTED FREQ. LESS THAN 5.0 


0 


3.5 




NPC 1 )+ VP+NP (25+C+NP (2)+VM 


0 


.1 




VHf VP+NP (2)+C+NPC2) 


0 


.1 




VFH- VP+NP (2)+C+NP (2)+VM 


0 


.0 




VPH- NP ( H+VP+NP C2)+C+NP (2) 


0 


.0 




VMf NPC D+VP+MP C25+C+NP <2>+VF4 


8 


6.7 


.5 


NPCD+VP+A 


0 


.0 




NPCD+W+VPH-A 


1 


.8 




NPCD+VP+A+K 


0 


.0 




NPCD+VP+VPH-A+K 


0 


1.6 




NPCD+VP+A+VM 


1 


6.0 


4.2 


EXPECTED FREQ. LESS THAN 5.0 



0 

0 

0 

0 



.0 

.0 

.0 

.0 



NPC1 )+VP+Vttf A+VM 
VP+VFH-A+K 
NP C 1 )+VP+A+0+VP 
NP ( J )+VP+A+0+VP+VM 




17 9.9 5.1 VP+NP C2J+VP 

10 9.9 .0 VP+NP C2)+VP+VM 

1 4.9 NPC 1 )+VP+NP C2)+VP 
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TABLE 1+2 (continued) 

2 4.9 



00094 



NPd )+VP+NP (2J+VP+VM 



3 


9.9 


4.8 


EXPECTED FREQ. LESS THAN 5.0 


0 


.1 




VMf VP+NP <2 >+ VP 


0 


.1 




VMfVP+NPC2)+VP+yf1 


0 


.1 




VMf NPd )+VP+NP(2)+VP 


0 


.1 




VMf NPd >+VP+NP(2)+VP+VM 


ft 


3.0 




VP+NP(2)+VP+NP<2) 


0 


3.0 




VP+NP(2)+VP+NP<2)+VM 


8 


6.3 


.4 


EXPECTED FREQ. LESS THAN 5.0 


1 


1.5 




NPd )+VP+NP(2)+VP+NP(2) 


0 


1.5 




NP < 1 )+ VP+NP (2)+ VP+NP (2 )+VM 


4 


12.0 


5.3 


VP+NP (2)+NP(l)+VP 


2 


2.0 




VP+ NP (2 > + NP ( 1 ) + VP+ VW 


3 


5.0 


.8 


EXPECTED FREQ. LESS THAN 5.0 


0 


.0 




VP+NP <2>+NP ( 1 >+VP+0+VP 


14 


6.0 


10.7 


NP(1)+ VP+NP (2)+NP(l)+VP 


1 


1.0 




NP(1)+VP+NP(2)+NP(1)+VP+VM 


0 


.0 




NP(1)+VP+NP(2)+NP(! )+VP+0+VP 


0 


.0 




VP+RP (2)+NP(l)+VP 


0 


.0 




VP+RP (2)+NP (1 )+VP+VM 


0 


.0 




VP+RP (2 ) +NP ( 1 ) +VP+0+ VP 


0 


.0 




NP(1 )+VP+RP(2)+NP(l )+VP 


0 


.0 




NP(!)+VP+RP(2)+NP(1)+VP+VM 


0 


.0 




NP( 1 )+VP+RP <2)+NP (1 )+VP+0+VP 


16 


14.2 


.2 


NP(1 )+VP+0+VP 


17 


12.5 


1.6 


NP ( 1 )+VP+0+ VP+NP (2 ) 


* 

i . 


12.5 


10.6 


NP ( 1 >+ VP+0+ VP+NP (2)+VM 


19 


14.2 


1.6 


NP(1 )+VP+0+VP+VM 


1 


.2 




VMf NP ( 1 )+ VP+O+VP 


0 


.1 




VMf NP ( 1 )+ VP+O+VP+NP (2 ) 


0 


.1 




VMf NP(1 )+VP+0+VP+NP(2)+VM 


0 


.2 




VMf NP(1 )+VP+0+\*+VM 


2 


3.5 




NPd )+VP+NP(2)+u+VP 


4 


5.1 


.2 


EXPECTED FREQ. LESS THAN 5.0 


5 


2.5 




NPd )+VP+NP (2)+0+VP+NP(2) 


0 


.0 




NPd )+VP+NP C2)+0+VP+NP(2)+VP 


5 


3.5 




NP(1 )+ VP+NP (2J+0+VP+VM 


10 


6.0 


2.7 


EXPECTED FREQ. LESS THAN 5.0 



ERIC 



0 

0 

0 

0 

G 



2.5 

.0 

.0 

.0 

.0 



NP ( 1 >+ VP+NP (2)+0+VP+NP (2)+VM 
NPd )+VP+NP(2)+0+ VP+NP (2)+VP+VM 
NPd)+C+NPd)+VP+0+VP+VM 
NP(1 )+VP+VMf 0+VP 
NP(1 >+VP+0+ VP+NP C2J+VP 
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TABLE 42 (continued) 



00095 



0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 

.0 



NP<l>+VP+C0N+0+VP 

NP(1)+VP+C0N+NP<1) 

NP< 1 >+VP+COI»+NP <1 >+VP 
NP(I >+VP+C0N+NP < 1 HVP+VM 
NP< 1 >+VP+NP (2)+C0N+NP < 1 ) 

NP( 1 )+VP+NP (2)+C0N+NP (! )+VP 
WPC 1 )+VP+MP <2>+C0N+NP C 1 >+VP+VM 
MP < 1 )+ VP+VMf CON+NP < 1 ) 

NP<1 )+VP+VWfCON+NP(l)+VP 
NP< 1 >+VP+VWfC0N+NP< 1 )+VP+VM 
NP< 1 )+VP+NP <2>+VFH-C0N+NP < 1 ) 

NP< 1 >+VP+NP <2)+VWfC0M+NP< 1 )+VP 
NP<1 )+VP+NP(2)+VWfC0N+NF (D+VP+VM 



0 2.5 RESIDUAL 

1072 1072.0 364.7 TOTAL 

2 DEGREES OF FREEDOM 



ERjt 
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TABLE 1+2 (continued) 



00096 



GINN HtlMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 




161 


162.2 


.0 


NPC1 )+ VP+NP C2) 




5 


6.4 


.3 


NP( 1 )+VP+NPC2)+K 




62 


59.5 


.1 


HP ( 1 )+ VP+NP C2)+VM 




2 


2.1 




Of UP <1 )+ VP+NP C2) 




0 


.1 




C+NPC1 )+VP+NPC2)+K 




1 


.8 




G+ NPCD+ VP+NP C2)+VW 




13 


9.2 


1.5 


VWf NPC 1 )+VP+NPC2) 




0 


.4 




VWf NPC 1)+ VP+NP C2)+K 




0 


3.4 




VWf NPC 1 )+VP+NPC2)+VM 




3 


6.8 


2.1 


EXPECTED FREQ. LESS THAN 


5.0 


25 


46.4 


9.8 


NPCD+VP 




106 


62.4 


30.5 


NPCD+VP+VM 




1 


10.4 


8.4 


NP ( 1 )+VP+K 




1 


13.9 


12.0 


NP(l)+VP+VHfK 




0 


.0 




R+NP(1)+VP 




0 


.0 




R+NPCD+VP+VM 




0 


.0 




R+NP(1)+VP+K 




0 


.0 




R+NPC1 )+VP+VMfK 




11 


7.7 


1.4 


C+NP(1)+VP 




10 


10.3 


.0 


C+ NPCD+VP+VM 




0 


1.7 




C+NPC1 )+VP+K 




1 


2.3 




C+NP < 1 )+VP+VMfK 




58 


30.7 


24.3 


VPH- NPCD+VP 




26 


41.3 


5.6 


VMf NPCD+VP+VM 




4 


6.8 


1.2 


VWf NPCD+VP+K 




0 


9.2 


9.2 


VWfNPCD+VP+VMfK 




18 


17.4 


.0 


VP+NP C2) 




1 


.7 




VP+NP C2)+K 




5 


6.4 


.3 


VP+NPC2)+VM 




0 


1.1 




VWf VP+NP C2) 




2 


5.8 


2.5 


EXPECTED FREQ. LESS THAN 


5.0 


1 


.0 




VWf VP+NP C2)+X 




1 


.4 




VWf VP+NP C2)+VM 




13 


47.8 


25.3 


VP 




61 


64.3 


.2 


VP+VM 




32 


14.3 


21.7 


VP+VWfK 




1 


.9 




VWf VP+VWfK 




28 


10.7 


28.2 


VP+K 




3 


.7 




VWfVP+K 




1 


3.1 




VWf VP 




7 


5.2 


.6 


EXPECTED FREQ. LESS THAN 


5.0 


7 


4.2 




VWf VP+VM 




0 


.0 




NPC 1 )+VP+VWf NPC2) 
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TABLE k2 (continued) 




ii 

0 

I: 

0 

0 

0 



0 

0 

LI 

I.! 

11 

0 

u 




Li 



LI 




1 


1.0 




NP<1 )+ VWf VP+VM 


8 


5.2 


1.5 


EXPECTED FREQ. LESS THAN 5.0 


1 


II. 0 




C+VWf NPC1 )+VP 


! 


7.2 


5.3 


VP+C+VP 


12 


4.8 




VP+C+VP+NP<2) 


15 


5.8 


8.4 


EXPECTED FREQ. LESS THAN 5.0 


4 


2.6 




VP+C+VP+VM 


0 


.3 




VP+C+VP+K 


0 


1.8 




VP+C+VP+NP(2)+VM 


0 


.2 




VP+C+ VP+NP (2)+K 


8 


3.0 




NP ( 1 )+ VP+C+VP 


12 


7.8 


2.1 


EXPECTED FREQ. LESS THAN 5.0 


0 


2.1 




NP< D+VP+C+ VP+NP <2> 


5 


4.1 




VP ( 1 )+ VP+C+VP+VM 


3 


6.2 


1.6 


EXPECTED FREQ. LESS THAN 5.0 


1 


2.8 




NP( 1 )+ VP+C+ VP+NP <2)+VM 


1 


4.5 




NP ( 1 )+C+NP ( 1 )+VP 


2 


7.3 


3.8 


EXPECTED FREQ. LESS THAN 5.0 


6 


3.1 




NP ( 1 )+C+NP ( 1 )+ VP+NP (2) 


1 


4.2 




NP ( 1 )+C+NP ( 1 )+VP+NP <2)+VPI 


7 


7.3 


.0 


EXPECTED FREQ. LESS THAN 5.0 


8 


6.1 


1.4 


NP(1 )+C+NP(l )+VP+VM 


2 


.3 




VWf NP ( 1 )+C+NP ( 1 )+VP 


0 


.2 




VWf NP ( 1 >+C+NP ( 1 )+VP+NP (2) 


0 


.3 




VWfNPCl)+C+NP(l)+VP+NP(2)+VM 


0 


.4 




VWf NP < 1 )+C+NP ( 1 )+ VP+VM 


1 


.7 




VP+NP <2>+C+NP<2) 


0 


1.0 




VP+NP(2)+C+NP(2)+VM 


3 


.8 




NP M )+ VP+NP (2 )+C+NP (2 ) 


0 


1.2 




NP< 1 )+ VP+NP (2)+C+NP(2)+VM 


0 


.0 




VWf VP+NP (2)+C+NP (2) 


0 


.1 




VWf VP+NP <2>+C+NP(2)+VM 


6 


5.0 


.2 


EXPECTED FREQ. LESS THAN 5.0 


0 


.1 




VPH- NP ( 1 )+ VP+ NP (2 )+C+ NP (2 ) 


0 


.1 




VWf NP< 1 )+VP+NP (2)+C+NP (2)+WI 


22 


24.8 


.3 


NP(1 )+VP+A 


0 


.0 




NP(1 )+VP+VMf A 


2 


1.0 




NP(1)+VP+A+K 


G 


.0 




NP ( 1 )+VP+VMf A+K 
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TABLE 42 (continued) 



00098 



O 

ERLC 



11 


9.1 


.4 


NP( 1 )+VP+A+VM 


0 


.0 




NPCn+VP+VWfA+VPl 


0 


.0 




VP+VMfA+K 


4 


4.7 




NP (1 )+VP+A+0+VP 


6 


5.8 


.0 


EXPECTED FREQ. LESS THAN 5.0 


7 


6.3 


.1 


NP ( 1 )+VP+A+0+ VP+VM 


11 


5.2 


6.5 


VP+NP(2)+VP 


4 


7.0 


1.3 


VP+NP (2)+VP+VM 


8 


6.0 


.6 


NP(1 )+VP+NP(2)+VP 


5 


8.1 


1.2 


NP ( 1 5+VP+NP <2>+VP+VM 


0 


.3 




VMf VP+NP C2J+VP 


0 


.5 




VMf VP+NP (2) + VP+VM 


0 


.4 




VMf NP < 1 )+ VP+NP (2)+VP 


0 


.5 




VMf NP < 1 J+VP+NP <2)+VP+VM 


2 


1.4 




VP+NP <2)+VP+NP (2) 


0 


1.9 




VP+NP (2J+VP+NP (2)+VM 


4 


1.6 




NP(1 )+VP+NP(2)+VP+NP(2) 


€ 


6.6 


.0 


EXPECTED FREQ. LESS THAN 5.0 


1 


2.2 




NP(1 )+VP+NP(2)+VP+NP <2)+VM 


9 


7.6 


.2 


VP+NP «2)+NP(l)+VP 


0 


1.8 




VP+NP <2>+NP(l)+VP+VM 


0 


.7 




VP+NP<2)+NP<1 )+VP+0+VP 


12 


8.9 


1.1 


NPC1 )+VP+NP<2)+NP(l )+VP 


1 


2.1 




NP( 1 )+VP+NP(2)+NP (1 )+VP+VM 


2 


6.8 


3.4 


EXPECTED FREQ. LESS THAN 5.0 


0 


.8 




NP< 1 )+VP+NP (2)+NP (1 )+VP+0+VP 


0 


2.1 




VP+RPC2)+NP(1 )+VP 


4 


.5 




VP+RP C2 ) +NP ( 1 ) + VP+VM 


0 


.2 




VP+RP (2)+NP ( 1 >+VP+0+VP 


0 


2.4 




NP(t )+VP+RPC2)+NP(l )+VP 


4 


6.0 


.7 


EXPECTED FREQ. LESS THAN 5.0 


0 


.6 




NP ( I J+VP+RP <2)+NP < 1 1+VP+ VM 


2 


.2 




NP ( 1 )+VP+RP (2J+NP (1 )+ VP+O+VP 


6 


13.0 


3.8 


NPU )+VP+0+VP 


17 


9.0 


7.2 


NP ( 1 )+VP+0+VP+NP (2) 


4 


12. 1 


5.4 


NP ( 1 )+ VP+O+VP+NP (2)+VM 


27 


17.5 


5.1 


NP<l)+VP+0+ VP+VM 


0 


.8 




VPH-NPU)+VP+0+VP 


1 


.6 




VM+ NP < 1 ) +VP+0+VP+ NP <2 ) 


0 


.8 




VMf NP(1 )+VP+0+VP+NP<2)+VM 


0 


1.1 




VMf NPCD+VP+0+ VP+VM 


5 


2.6 




NP (i)+ VP+NP <2>+0+ VP 


8 


6.7 


.2 


EXPECTED FREQ. LESS THAN 5.0 
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TABLE k2 (continued) 



00099 



0 



6 2.6 

0 .0 

I 3.4 



NP< I >+VP+NP <2>+0+VP+NP (2) 
NPd >+VP+N^ <2>+0+VP+NP <2>+VP 
NP ( 1 )+VP+NP <2>+0+VP+VM 



li 

0 



7 6.0 



.2 EXPECTED FRED. LESS THAN 5.0 



0 

0 

0 

0 

3 



3.4 

.0 

.0 

.0 

3.0 



NP ( 1 7+VP+NP <2)+0+VP+NP <2>+ VM 
NP( 1 )+VP+NP <2>+0+VP+NP<2)+VP+VM 
NP( 1 >+C+NPd J+VP+O+VP+VM 
NPd )+VP+WMfO+VP 
NP ( 1 )+VP+0+VP+NP (2 )+ VP 



0 



3 



6.4 1.8 EXPECTED FREQ. LESS THAN 5.0 



0 



u 

u 

li 




NPd>+VP+C0N+0+VP 
NP ( I )+VP+CON+NP ( I ) 

NP ( 1 >+VP+CON+NP ( I >+VP 
NPd)+VP+CON+NPd)+VP+VM 
NP ( I >+VP+NP (2)+C0N+NP ( I ) 
NPd)+VP+NP<2>+C0N+NPd>+VP 
NP ( 1 )+VP+MP (25+CON+NP < 1 >+VP+ Wl 
NP ( I >+VP+ VW+CON+NP ( t ) 

NP < I )+VP+ VMf CON+NP d >+VP 
NP ( 1 )+VP+ VMf CON+NP ( I )+ VP+VM 
NP d >+VP+NP (2)+VMf CON+NP d ) 

NP ( I 5+VP+NP (2)+VMf CON+NP ( I )+ VP 
NPC I >+VP+3P (2)+VMfC0N+NP ( I )+VP+VM 



0 -.0 



RESIDUAL 



0 



892 892.0 249.6 TOTAL 



8 DEGREES OF FREEDOM 

li 



Li 



0 

o 

ERIC 
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TABI£ US (continued) 




SCOra-PORBaMAN PRIMER 



ERjt 



OBSERV. 


EXPECT ► 


CH1**2 


SOURCE 






82 


19. 9 


• ,1 ' 


NPUV+VP+NPCS) 






6 


8.i 


.6 


NPCI )+VP+NP(2)+K 






50 


50.0 


«,0 


NP<I>+VP+NP(2>+VM 






4 


4.1 




C+NPC 1 )+VP+NP(2) 






0 


• 4 




CfNP(l)+VP+NPC2)+K 






3 


2.5 




C+NPC 1 )+VP+NPC2)+VH 






r 7 


7.0 


.0 


EXPECTED PRES. LESS 


THAN 


5.0 


18 


11.0 


4.4 


Wf NP C 1 >+VP+NP C2> 






0 


1,1 




VMf NP < 1 )+VP+NP C2)+K 






1 


8.9 


5.0 


VHMPCI>+VP+NPC2>+VH 




14 


23.3 


5.1 


NPC1M-VP 






72 


43.8 


18.2 


RPCD+VP+VM 






0 


6.2 


6.2 


NPCD+VP+K 






0 


10.7 


10.7 


NPCI >+VP+VWfR 






8 


2.6 




R+NPCD+VP 






1 


4.6 




R+NP C 1 )+VP+VN 






9 


8.3 


.1 


EXPECTED FREQ. LESS 


THAN 


5.0 


0 


.6 




R+NPCD+VP+K 






0 


1.1 




R+NPC1 >+VP+VNM( 






3 


1.5 




C+NP C 1 )+VP 






2 


2.5 




C+NPC 1 )+VP+VW 






3 


5.8 


.1 


EXPECTED FREQ. LESS 


THAN 


5.0 


0 


• 4 




C+NPC1J+VP+K 






0 


.6 




C+NPC l>+VP+VH+K 






33 


16.5 


16.6 


VWfNPCl )+VP 






21 


28.9 


2.0 


W-NP Cl )+VP+VM 






2 


4.0 




VH+MPCD+VP+K 






2 


3.0 


1.8 


EXPECTED FREQ. LESS 


THAN 


5.0 


0 


7.0 


7.0 


VWfNPCI>+VP+VHfK 






8 


13.4 


2.2 


VP+NPC2) 






4 


1.4 




VP+NPC2)+K 






6 


8.4 


.7 


VP+NPC2>+VM 






0 


2.2 




VMf VP+NPC2) 






1 


•2 




VNf VP+NP C2)+K 






8 


1.4 




WH*VP+NPC2>+V« 






13 


5.2 


11.7 


EXPECTED FREQ. LESS 


Than 


5.0 


12 


24.7 


6.6 


VP 






59 


42.8 


1.4 


VP+VM 






41 


10.3 


88.8 


VPPVWtK 
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TABLE 42 (continued) 



1 


1.7 




VMfVP+VWfK 


6 


€.1 


.0 


VP4-K 


0 


1.0 




VFH-VP+K 


0 


4.1 




VMfVP 


1 


6.8 


5.0 


EXPECTED FREQ. LESS THAN 5.0 


3 


7.1 


2.4 


VMfVP+VM 


0 


.0 




KP<1>+VP+VW+NP<2> 


1 


1.0 




NP( 1 )+VMf VP+VW 


0 


.0 




C+VW+NP<1>+VP 


0 


3.3 




VP+C+VP 


2 


2.5 




VP+C+VP+NPC2) 


3 


6.8 


2.1 


EXPECTED FREQ. LESS THAN 5.0 


6 


2.1 




VP+C+ VP+VM 


0 


.3 




VP+C+VP+K 


0 


1.6 




VP+C+VP+NP (2>+VP 


2 


.3 




VP+C+VP+NP (2) +K 


4 


1.7 




NP ( 1 ) + VP+C+ VP 


12 


5.9 


6.4 


EXPECTED FREQ. LESS THAN 5.0 


3 


1.3 




NP( ! )+ VP+C+VP+NP (2) 


1 


2.9 




VPC1>+VP+C+VP+VM 


0 


2.2 




RPCD+VP+C+ VP+NP <2>+VPI 


4 


6.3 


.9 


EXPECTED FREQ. LESS THAN 5.0 


0 


3.7 




NPa>+C+NP(l>+VP 


2 


2.9 




NP C 1 5+C+NP C 1 >+VP+NP (2) 


2 


6.6 


3.2 


EXPECTED FREQ. LESS THAN 5.0 


5 


4.9 




NP<i>+C+NP(l )+VP+NP(2)+VM 


4 


6.5 


.9 


NP(1)+C+NPU>+VP+VM 


6 


• 6 




VPH-NPCD+C+NPCD+VP 


11 


5.6 


5.3 


EXPECTED FREQ. LESS THAN 5.0 


0 


.5 




VWfNPCl>+C+NP(l)+VP+NP(2> 


1 


.8 




VH+ NP ( 1 >+C+ NP ( 1 )+ VP+NP (2)+VM 


3 


1.1 




VM+ NP ( 1 ) +C+NP C 1 ) +VP+ VM 


1 


1.5 




VP+NP C2)+C+NF’<2> 


0 


2.6 




VP+NP C2)+C+NPC2)+VW 


5 


6.5 


.3 


EXPECTED FREQ. LESS THAN 5.0 


5 


1.3 




NP( 1 5+VP+NP (2)+C+NP (2) 


0 


2.3 




NP ( 1 )+ VP+NP <2)+C+NP (2)+VM 


0 


.2 




VMf VP+NP (2)+C+NP (2) 


1 


.4 




VTO- VP+NP (25+C+NP (27+VM 


2 


.2 




VHf NP C 1 ) + VP+NP (2 ) +C+NP (2 ) 
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TABlil 42 (continued) 



00102 



.4 



VMf NP (1 5+VP+NP (25+C+NP (25+VM 



9 8.1 .1 NP(15+VP+A 

5 3.5 NP ( 1 5+VP+VMf A 



13 



8.4 



2.5 EXPECTED FREQ. LESS THAN 5.0 



0 .8 

0 .4 

5 5.1 .0 

1 2.2 

0 .0 

2 .7 

0 1.3 



NP(1 5+VP+A+K 
NPd 5+VP+VMf A+K 
NP( 1 5+VP+A+VM 
NP(15+VP+VMfA+VM 
VP+VMfA+K 
NP(15+VP+A+0+VP 
NP < 1 5+ VP+A+O+VP+VM 



3 5.3 1.0 EXPECTED FREQ. LESS THAN 5.0 



2 2.5 VP+NP (25+VP 

7 4.3 VP+NP (25+VP+VM 

9 8.8 .7 EXPECTED FREQ. LESS THAN 5.0 



0 2.2 NPd 5+ VP+NP (25+VP 

3 3.8 NPC1 )+ VP+NP (25+VP+VM 

3 6.1 1.5 EXPECTED FREQ. LESS THAN 5.0 



0 

0 

0 

3 

2 

5 



.4 
.7 
.4 
.6 
1 *7 
3.0 



VMf VP+NP (25+VP 
VMf VP+NP (25+VP+VM 
VMf NP<1 5+VP+NP (25+VP 
VMf NPd 5+VP+NP (25+VP+VM 
VP+NP (25+VP+NP (25 
VP+NP (25+VP+NP (25+VM 



10 6.9 1.4 EXPECTED FREQ. LESS THAN 5.0 



1 1.6 

1 2.7 

8 7.1 

0 .9 



NP(1 5+VP+NP (25+VP+NP (25 
NP(!5+VP+NP(25+VP+NP(2)+VM 
VP+NP(25+NP(l 5+VP 
VP+NP (25+NP (1 5+VP+VM 



2 5.2 1.9 EXPECTED FREQ. LESS THAN 5.0 



0 

7 

0 

1 

0 

1 

0 

0 

1 

0 



.5 

6,3 

•8 

.4 

.9 

.1 

.1 

.8 

.1 

.1 



VP+ NP (2 5+NP ( 1 5 + VP+O+VP 
NP ( 1 5+ VP+NP (25 +NP ( 1 5 + VP 
NPd 5+VP+NP(25+NP(l 5+VP+VM 
NP ( 1 5+ VP+NP (25+NP (1 5+VP+O+VP 
VP+RP (25 +WPd 5+VP 
VP+RP (2 5 +NP (1 5 + VP+ VM 
VP+RP (2 5 + NP ( 1 5 + VP+0+ VP 
NP ( 1 5+ VP+RP (25+NP (1 5+VP 
NP ( 1 5+ VP+RP (25+NP ( 1 5+VP+VM 
NPd 5+VP+RP (25+NP ( 1 5+VP+O+VP 



TABLE k-2 (continued) 



3 


9.8 


4.7 


17 


7.5 


12.1 


9 


12.9 


1.2 


23 


16.9 


2.2 


1 


1.6 




4 


5.4 


.3 


1 


1.2 




0 


2.1 




I 


2.8 




2 


6.2 


2.8 


7 


3.7 




4 


2.6 




11 


6.2 


3.7 


0 


.0 




3 


6.3 


1.8 


3 


4.4 




0 


.0 




0 


.0 




0 


.0 




3 


3.0 




6 


7.4 


.3 


0 


.0 




0 


.0 




2 


.4 




1 


.2 




0 


.0 




0 


.3 




0 


.2 




0 


.0 




0 


.7 




0 


.4 




6 


.0 




0 


.5 




0 


.3 




3 


3.0 





636 636.0 254.2 



O 

ERIC 



00103 



NPCD+VP+O+VP 

NPCl)+VP+0+VP+NP(2> 

NP< 1 )+VP+0+ VP+NP (2) + VM 
NP< 1 )+VP+0+VP+VM 
VPH-NP<l)+VP+0+VP 

EXPECTED FREQ. LESS THAN 5.0 

WH-NP(1 >+VP+0+VP+NP<2) 

VW+ NP ( I HVP+O+VP+NP <2>+VM 
V!#-NP<l)+VP+0+VP+VM 

EXPECTED FREQ. LESS THAN 5.0 

NP ( 1 )+VP+NP (2)+0+VP 

NP( I )+VP+NP C2)+0+VP+NP<2) 

EXPECTED FREQ. LESS THAN 5.0 

NPU>+ VP+NP C25+0+ VP+NP <2)+VP 
NP< 1 )+ VP+NP C2)+0+VP+VM 
NP(l)+VP+NPC2)+0+ VP+NP <2)+VM 
NP( I )+ VP+NP <2)+0+VP+NP<2)+VP+VN 
NPd)+C+NP<l)+VP+0+VP+VM 
NP(1 >+VP+VMfO+VP 
NP(l)+VP+0+VP+NP<2)+VP 

EXPECTED FREQ. LESS THAN 5.0 



NP(l)+VP+C0N+0+VP 

NP(1)+VP+C0N+NP<1) 

NP ( 1 ) + VP+CON+NP ( 1 ) + VP 
NP ( I )+ VP+CON+NP< 1 )+VP+VM 
NP(1)+VP+NP(2)+C0N+NP<1) 

NPCD+ VP+NP (2)+CGN+NP<l)+VP 
NP ( I )+ VP+NP (2) +CON+NP ( I ) + VP+ VPI 
«P < i )+VP+VW+CON+NP < I ) 

NP ( I )+ VP+VWf CON+NP ( 1 )+VP 
NP( I )+VP+VM*CON+NP(l )+VP+VM 
NP ( 1 )+ VP+NP <2)+VMf CON+NP < I ) 

NP ( I )+ VP+NP <2)+VWf CON+NP ( 1 )+VP 
NP ( I )+ VP+NP (2)+VM+C0N+NP ( I J+VP+VM 



RESIDUAL 

TOTAL 

DEGREES OF FREEDOM 
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TABLE k-2 (continued) 



00104 



PRIMERS COMBINED 



09SERV. 


EXPECT. 


CHI**2 


SOURCE 


243 


240.6 


.0 


NP<1)+VP+NP<2> 


1 1 


14.8 


1.0 


NPd )+VP+NP(2)+K 


1 12 


110.5 


.0 


NP(I)+VP+NP<2>+VW 


6 


6.6 


.1 


C+NPd )+VP+NP (2) 


0 


.4 




C+NP<1 )+VP+NP (2)+K 


4 


3.0 




C+NPd )+VP+NP (2)+VM 


3! 


21.0 


4.7 


VWfNPd)+VP+NP<2> 


0 


1.3 




VWfrNPd )+VP+NP(2)+K 


1 


9.7 


7.8 


VPH- NP <1 )+VP+NP (2)+VM 


39 


71.3 


14.6 


NP < 1 )+VP 


1 78 


106.5 


48.0 


NPd)+VP+VPI 


1 


16.5 


14.6 


NP<1 )+VP+K 


1 


24.7 


22.7 


NP <1 )+VP+VMfK 


8 


2.9 




R+NPd ) + VP 


12 


7.7 


2.5 


EXPECTED FREQ. LESS ' 


t 


4.4 




R+NPd )+VP+ VP! 


0 


.7 




R+NPd )+VP+K 


! 


5.1 


3.3 


EXPECTED FREQ. LESS ’ 


0 


1.0 




R+NPd )+VP+VPR-K 


14 


8.8 


3.1 


C+NPd >+VP 


12 


13.1 


.1 


C+NPd )+VP+VPI 


0 


2.0 




C+NPd )+VP+K 


1 


3.0 




C+NPd )+VP+VMfK 


1 


6.1 


4.3 


EXPECTED FREQ. LESS ' 


91 


46.9 


41.5 


VW+NPd ) + VP 


47 


70.0 


7.6 


VW+NPd )+VP+VM 


6 


10.9 


2.2 


VM+NPd )+VP+K 


0 


16.2 


16.2 


VWfNPd )+VP+VM+K 


26 


31.5 


.9 


VP+NP<2) 


5 


1.9 




VP+NP(2)+K 


11 


14.4 


.8 


VP+NP(2>+VM 


0 


3.4 




VPH-VP+NP<2) 


5 


5.3 


.0 


EXPECTED FREQ. LESS * 


2 


.2 




VM+VP+NP C2)+K 


9 


1.6 




VWf VP+NP (2)+W 


25 


71.7 


30.4 


VP 


96 


107.1 


1.1 


VP+VP1 


* 7 l 

sLC 34 


24.8 


93.5 


VP+VPH-K 


2.7 




VW+ VP+VM+K 


16.6 


18.2 


VP+K 


“ 3 


1.8 




VM+VP+K 



THAN 5.0 
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A i.. i 's.i. . 

TABLE 42 (continued) 



00105 



o 

ERIC 



1 6 


6.2 


15.3 


EXPECTED FREQ. LESS THAN 5.0 


1 


7.7 


5.9 


VMfVP 


10 


11.6 


.2 


VMf VP+VM 


0 


.0 




NP ( 1 5+VP+VMf NP (2) 


2 


2.0 




NP(1)+ VMf VP+VM 


l 


1.0 




C+VMf N!P(15+VP 


1 


10.3 


8.4 


VP+C+VP 


14 


7.5 


5.7 


VP+C+ VP+NP (2) 


10 


4.7 




VP+C+ VP+VM 


13 


7.7 


3.6 


EXPECTED FREQ. LESS THAN 5.0 


0 


.6 




VP+C+VP+K 


0 


3.4 




VP+C+ VP+NP (25+VM 


2 


.5 




VP+C+ VP+NP (2)+K 


12 


4.7 




NP( 15+ VP+C+VP 


14 


9.2 


2.5 


EXPECTED FREQ. LESS THAN 5.0 


3 


3.4 




NP ( 1 ) + VP+C+ VP+NP (2 ) 


4 


6.9 


1.3 


VP (D + VP+C+ VP+VM 


1 


5.0 


3.2 


NP ( 1 5+ VP+C+ VP+NP (25+VM 


1 


8.4 


6.5 


NP ( 1 5 +C+ NP ( 1 ) +VP 


8 


6.1 


.6 


NP < ! 5 +C+ NP ( 1 5+VP+NP (2) 


6 


9.1 


1.0 


NP(1)+C+NP(1 5+VP+NP (25+VM 


13 


12.5 


.0 


NP( 1 5+C+NP ( 1 5+VP+VM 


8 


.9 




VMf NP( 1 5+C+NP(l 5+VP 


0 


.7 




VMf NP ( 1 5 +C+ NP ( 1 5+ VP+NP (2 5 


1 


1.0 




VMf NPd 5+C+NP (1 5+VP+NP (25+VM 


12 


5.9 


6.3 


EXPECTED FREQ. LESS THAN 5.0 


3 


1.4 




VMf NP ( 1 5+C+NP ( 1 5+VP+VM 


O 

c* 


2.3 




VP+NP (25+C+NP (25 


0 


3.5 




VP+NP (25+C+ NP (25+VM 


5 


7.1 


.6 


EXPECTED FREQ. LESS THAN 5.0 


8 


2.4 




NP(1 5+VP+NP (25+C+NP (25 


0 


3.6 




NP ( 1 5+VP+NP (25+C+NP (25+VM 


8 


6.0 


.7 


EXPECTED FREQ. LESS THAN 5.0 


0 


.2 




VMf VP+NP (25+C+NP (2 5 


1 


.4 




VMf VP+NP (25+C+NP (25+VM 


2 


.3 




VMf NP ( 1 5 + VP+NP (2 5 +C+ NP (25 


0 


.4 




VM+NP(1 5+VP+NP (25+C+NP (25+VM 


31 


32.2 


.0 


NPd 5+VP+A 


5 


3.9 




NP( 1 5+VP+VMf A 


8 


5.2 


1.5 


EXPECTED FREQ, LESS THAN 5.0 
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I TABLE 42 (continued) 



00106 



i 

S 



l 

t 







2 


2.0 




NPd)+VP+A+K 


0 


.2 




NPd)+VP+VFW-A+K 


16 


14.8 


.1 


NPd)+VP+A+VM 


1 


1.8 




NPd)+VP+VM+A+VM 


0 


.0 




VP+VMfA+K 


6 


5.2 


.1 


NPd)+VP+A+0+VP 


7 


7.8 


.1 


NPd)+VP+A+0+VP+VM 


13 


7.7 


3.7 


VP+NP(2>+VP 


11 


11.4 


.0 


VP+NP <2>+VP+VM 


8 


7.9 


.0 


NPd)+VP+NP(2)+VP 


8 


1J.8 


1.2 


NPd >+ VP+NP <2>+VP+VW 


0 


.8 




VWf VP+NP (2)+VP 


0 


1.2 




VfW\ VP+NP (2)+VP+VM 


3 


6.1 


1.6 


EXPECTED FREQ. LESS THAN 9.0 


0 


.9 




VPIfNPd>+VP+NP<2>+VP 


3 


1.3 




VFH-NPd>+VP+NP<2)+VP+VPl 


4 


3.2 




VP+NP (2)+VP+NP<2) 


7 


5.3 


.6 


EXPECTED FREQ. LESS THAN 5.0 


5 


4.7 




VP+NP (2>+ VP+NP (2)+VM 


5 


3.3 




NP ( 1 )+VP+NP (2)+VP+NP (2) 


10 


8.0 


.5 


EXPECTED FREQ. LESS THAN 5.0 


2 


4.9 




NP ( 1 )+VP+NP (27+VP+NP (2)+VM 


17 


14.6 


.4 


VP+NP (2)+NPd)+VP 


0 


2.8 




VP+NP(2)+NPd )+VP+VW 


2 


7.7 


4.2 


EXPECTED FREQ. LESS THAN 5.0 


0 


1 -2 




VP+NP (2)+NP(l?+VP+0+ VP 


19 


15.1 


1.0 


NP C 1 )+ VP+NP (2)+NP ( 1 )+VP 


1 


2.9 




NPd )+VP+NP (2)+NPd )+VP+VM 


1 


1.3 




NPd )+VP+NP(2)+NPd )+VP+0+VP 


2 


5.4 


2.2 


EXPECTED FREQ. LESS THAN 5.0 


0 


3.1 




VP+PP(2)+NP(1)+VP 


5 


.6 




VP+RP(2)+NPd )+VP+VFI 


0 


.3 




VP+RP (2J+NP ( I J+VP+O+VP 


0 


3.2 




NP ( 1 )+VP+RP (2)+NP ( 1 )+ VP 


5 


7.1 


.6 


EXPECTED FREQ. LESS THAN 5.0 


1 


.6 




NP C 1 5+VP+RP (2)+NP ( 1 5+VP+VM 


2 


.3 




NPd )+ VP+RP (2)+NP d )+VP+0+ VP 


9 


23.1 


8.6 


NP(l)+VP+0+VP 


34 


16.7 


17.8 


NP ( 1 )+VP+0+ VP+NP (2) 


13 


25.0 


5.7 


NP C 1 )+VP+0+ VP+NP <2)+VH 



1 

2 

6 

0 

1 

1 

2 

0 

0 

4 

3 

0 

0 

0 

6 

0 

0 

2 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

3 
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(continued) 




34.5 7.0 NPd )+VP+0+VP+VM 

2.5 VMf NP<1 )+VP+0+VP 

1.8 VMfNP < 1 5+VP+O+VP+NP (2) 

5.2 .1 EXPECTED FREQ. LESS THAN 5.0 



2.7 VMfNPd )+VP+0+VP+NP(2)+VM 

3.7 vmfNP(l)+VP+0+VP+VM 

6.4 4.6 EXPECTED FREQ. LESS THAN 5.0 

6.4 4.9 NP <1 )+VP+NP(2)+0+VP 

5.2 4.4 NP(l)+VP+NP(2)+0+VP+NP<2) 

.0 NP < 1 )+VP'fNP <2)4-0+VP+NP <2)+VP 

9.6 3.3 NPd)+VP+NP<2)+0+VP+VM 

7.8 2.9 NP(!)+VP+NP(2)+0+VP+NP(2)+Vn 

.0 NPd)+VP+NP<2)+(W-VP+NP<2>+VP+VM 

.0 NPd>+C+NPd>+VP+0+VP+VM 

.0 NP <1 )+VP+VMfO+VP 

6.0 .0 NP(l)+VP+0+VP+NP(2)+VP 



.0 NP < 1 )+ VP+C ON+O+VP 

.0 NPd)+VP+CON+NPd> 

.5 NPd )+VP+CON+NPO )+VP 

.2 NPd)+VP+CON+NPd)+VP+VM 

.0 NPd 5+VP+NP (2)+C0N+NPd ) 

.3 NP(1)+VP+NP(2)+C0N+NP(1)+VP 

.2 NP C 1 )+VP+NP (2)+C0N+NP d 5+VP+VM 

.0 NPd)+VP+VMfCON+NPd> 

.7 NPd)+VP+VfH-CON+NPd)+VP 

.3 NPd)+VP+V»fCON+NPd)+VP+VM 

.0 NP(l)+VP+NP(2)+VWfC0N+NP(l) 

.5 NPd)+VP+NP(2)+VMfC0N+NPd)+VP 

.3 NP(1 )+VP+NP(2)+VF , rfC0M+NP(l )+VP+VM 



3.0 RESIDUAL 

1528.0 478.3 TOTAL 



26 



DEGREES OF FREEDOM 



00108 



TABLE 42 (continued) 



GINN HEADER 
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;erv. 


EXPECT. 


CHI**2 


SOURCE 


185 


189.1 


.1 


NP( 1 )+VP+NP(2) 


6 


9.1 


1.1 


NP< I )+VP*f NP(2)+K 


128 


120.8 


.4 


NP(!) + VP+NP(2)+VM 


19 


16.0 


.6 


C+NP(1)+VP+NP(2> 


0 


.8 




C+NP(1 5+VP+NP <2)+K 


8 


10.2 


.5 


C+NP(S)+VP+NP(2)+VM 


36 


34.4 


.1 


VWf NP < 1 )+VP+ NP (2) 


0 


1.7 




VWfNP<l)+VP+NP(2)+K 


22 


22.0 


.0 


VWfNPCl )+VP+NP(2)+VM 


27 


72.3 


28.3 


NPCD+VP 


231 


159.8 


31.7 


NPCn+VP+VM 


1 


9.3 


7.4 


NP (1 )+VP+K 


3 


20.6 


15.0 


NP(I)-fVP+VWfK 


3 


.8 




R+NPCO+VP 


0 


1.8 




R+NPCD+VP+VM 


3 


5.1 


.9 


EXPECTED FREQ. LESS 


0 


.1 




R+NP(1)+VP+K 


0 


.2 




R-f NP { i ) +VP+ VWf K 


11 


12.7 


.2 


Of NP(D+VP 


35 


28.1 


1.7 


C+NP( 1 5+VP+VM 


0 


1.6 




C+NP(1)+VP+K 


0 


3.6 




C+NP(l)+VP+VWfK 


0 


5.6 


5.6 


EXPECTED FREQ. LESS 


76 


50.5 


12.9 


VWfNP(l)+VP 


100 


111.6 


1.2 


VWf NP<1)+VP+VW 


5 


6.5 


.3 


VWf NP(1 )+VP+K 


2 


14.4 


10.7 


VWf NP(l)+VP+VWfK 


15 


15.6 


.0 


VP+NP<2> 


2 


.8 




VP+NP(2)+K 


0 


10.0 


10.0 


VP+NP(2>+VW 


11 


2.2 




VWf VP+NPC2) 


1 


.1 




VWf VP+NP(2)+K 


1 


1.4 




VWf VP+NP(2)+VW 


31 


37.3 


1.1 


VP 


51 


82.5 


12.0 


VP+VW 


38 


10.6 


70.4 


VPfVWfK 


0 


1.5 




VWf VP+VWf K 


15 


5.9 


14.1 


EXPECTED FREQ. LESS 


25 


4.8 




VP+K 


0 


.7 




VMfVP+K 


25 


5.5 


69.6 


EXPECTED FREQ. LESS 



0 0*001*— — o O — O — — VJ 00*0* CO 0\ 00*01 00 -4 — 0 O 0*^100 to a\ -4 0*J*U>0 



CTS-MeW 






TABLE 42 (continued) 



10 . 



00109 



ERIC 



14 



5.2 

11.4 

4.0 

5.0 

7.0 

6.0 

7.4 
5.1 

4.7 
.4 

5.1 

3.2 

.2 

3.0 

6.4 

2.0 

6.5 

4.5 

6.5 

5.2 

3.6 

7.9 

11.5 
•• 

• r 

.5 

1.1 

5.9 

1.6 

1.8 

3.9 

7.3 

3.2 

7.0 

.2 

.5 

.4 

1.0 

5.3 



5.2 VMfVP 

.5 VWfVP+VM 

NP < 1 3+VP+VWf NP (2) 

NP( 1 )+ VWfVP+VM 

.0 EXPECTED FREQ. LESS THAN 5.0 

.0 C+WH-NPC13+VP 

3.9 VP+C+VP 
S .7 VP+C+VP+NPC23 
VP+C+VP+VM 
VP+C+VP+K 

4.8 EXPECTED FREQ. LESS THAN 5.0 
VP+C+VP+HP C23+VW 

vp+c+ vp+np cam 

NP (I 3+ VP+C+VP 

.4 EXPECTED FREQ. LESS THAN 5.0 

NPCI3+VP+C+ VP+NP (2) 

1.9 VP U 3+ VP+C+VP+VM 

NPC 1 3+VP+C+VP+NP C23+VM 

.0 EXPECTED FREQ. LESS THAN 5.0 

2.0 NPC13+C+HPC13+VP 

NP < I 3+C+NP Cl 3+VP+NPC23 

3.0 NPC 1 3+C+NP C13+VP+NPC23+VM 

1.8 NPC 1 3+C+NPC! 3+VP+VM 
VM4- NP Cl 3+C+NP Cl 3+VP 

VWf NP C 1 ) +C+ NP C 1 3 + VP+NP C2) 
VM+NP Cl 3+C+NP C13+VP+NPC23+VM 

2.9 EXPECTED FREQ. LESS THAN 5.0 

VpH-NP Cl 3+C+NP Cl 3+VP+VM 
VP+NP C23+C+NP (23 
VP+NP (2 3+C+NP (23+VM 

5.4 EXPECTED FREQ. LESS THAN 5.0 

NPC 1>+ VP+NP (23+C+NPC23 

1.3 NP C 1 3+VP+NP C23+C+ NP (23+VM 
VWf VP+NP (23+C+NPC23 

VWf VP+NP C23+C+NP (23+VM 
VPH-NP Cl 3+VP+NP (2 3+C+NP C2) 

VWf NP C 1 3+VP+NP (2 3+C+NP (23+VM 



14.0 EXPECTED FREQ. LESS THAN 5.0 
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00110 



TABLE k2 (continued) 



I: 

| 



i. O 

fERJjC 



2 


7.1 


3.7 


NP(l) + VP+VMfA 


1 


.7 




NP( 1 )+VP+A+K 


0 


.3 




NP(1 )+VP+VMf A+K 


10 


9.1 


.1 


NP(1)+VP+A+VM 


10 


4.5 




NP ( 1 ) + VP+ VMf A+ VM 


11 


5.6 


5.3 


EXPECTED FREQ. LESS THAN 5.0 


4 


4.0 




VP+VMfA+K 


0 


1.6 




NP < 1 )+ VP+A+O+VP 


4 


5.6 


.4 


EXPECTED FREQ. LESS THAN 5.0 


5 


3.4 




NP < 1 )+ VP+A+O+VP+VM 


6 


2.6 




VP+NP(2)+VP 


11 


6.0 


4.2 


EXPECTED FREQ. LESS THAN 5.0 


8 


5.6 


1.0 


VP+NP(2)+VP+VM 


4 


4.6 




NP(1)+VP+NP(2)+VP 


7 


10.1 


.9 


NP < 1 > + VP+NP <2 ) + VP+ VM 


0 


.4 




VMf VP+NP C2)+VP 


0 


.8 




VMf VP+NP <2)+VP+VM 


4 


5.7 


.5 


EXPECTED FREQ. LESS THAN 5.0 


0 


.6 




VMf NP ( I )+VP+NP(2)+VP 


1 


1 .4 




VMf NP C 1 ) + VP+ NP <2 ) + VP+ VM 


2 


1.8 




VP+NP(2)+VP+NP(2) 


6 


4.0 




VP+NP <2 > + VP+NP <2 ) + VM 


9 


7.8 


.2 


EXPECTED FREQ. LESS THAN 5.0 


3 


3.2 




NP < 1 ) + VP+NP (2 ) + VP+ NP <2 ) 


5 


7.1 


.6 


NP(1 )+ VP+NP (2)+ VP+NP (2)+VM 


ri 

w 


1.3 




VP+NP C2)+NP(1 )+VP 


0 


.3 




VP+ NP (2 ) +NP < 1 ) + VP+ VM 


0 


.2 




VP+NP(2)+NP(1 )+VP+0+VP 


4 


2.3 




NP(1)+VP+NP(2)+NP<1 )+VP 


7 


7.3 


.0 


EXPECTED FREQ. LESS THAN 5.0 


1 


.5 




NP( 1 )+VP+NP(2)+NP(l )+VP+VM 


0 


.4 




NP < 1 > +VP+ NP (2 )+ NP ( 1 )+ VP+0+ VP 


4 


3.4 




VP+RP(2)+NP(1 )+VP 


2 


.8 




VP+RP (2 ) +NP ( 1 ) + VP+ VM 


7 


5.0 


.8 


EXPECTED FREQ. LESS THAN 5.0 


0 


.5 




VP+RP (2)+NP ( 1 )+VP+0+VP 


5 


6.0 


.2 


NPa>+VP+RP(2)+NP<l)+VP 


0 


1.4 




NP < 1 > + VP+RP <2 > +NP < 1 > + VP+ VM 


2 


.9 




NP C 1 )+ VP+RP (2)+NP <1 >+VP+0+VP 



* 




TABLE 4-2 (continued) 



O 

ERIC 



2 


9.7 


.6.1 


NP < 1 5+VP+O+VP 


20 


6.7 


26.6 


NP < 1 5+ VP+O+VP+NP (2) 


10 


14.8 


1.5 


NP < 1 5+ VP+O+VP+NP (25+VM 


23 


21.3 


.1 


NP< 1 5+VP+O+VP+VN 


1 


1.4 




VWf NP ( 1 5+VP+O+VP 


1 


.9 




VWf NP ( 1 )+ VP+O+VP+NP (2) 


4 


5.1 


.2 


EXPECTED FREQ. LESS THAN 5.0 


0 


2.0 




VWf NP ( 1 )+ VP+O+VP+NP (2)+Vi* 


3 


3.0 




VM+ NP ( 1 )+ VP+0+VP+ VM 


3 


5.0 


.8 


EXPECTED FREQ. LESS THAN 5.0 


5 


2.8 




NP < 1 )+VP+NP <2)+0+VP 


1 


.6 




NP ( 1 5+VP+NP <2)+0+VP+NP (2) 


0 


•0 




NP(15+VP+NP(25+0+VP+NP(25+VP 


4 


6.2 


.8 


NP < 1 ) + VP+ NP <2 ) +0+ VP+ VM 


1 


1.4 




NP( 15+ VP+NP (25+0+ VP+NP (25+VM 


0 


• 0 




NP < 1 5+VP+NP (25+0+ VP+NP <2)+VP+VM 


0 


• 0 




NP < i 5+C+NP < 1 )+VP+0+VP+VW 


5 


5.0 


.0 


NP<1 5+VP+VM+Q+VP 


3 


3.0 




NPC1 )+VP+0+ VP+NP (2)+VP 


10 


7.8 


.6 


EXPECTED FREQ. LESS THAN 5.0 


5 


5.0 


.0 


NP < 1 5+VP+CON+O+VP 


0 


1.5 




NP< 1 )+VP+CON+NP <1 ) 


1 


1.3 




NP < 1 5+VP+CQN+NP < 1 )+VP 


1 


.2 




NP(1)+VP+C0N+NP(1)+VP+VM 


0 


1.0 




NP < 1 5+VP+NP (2)+C0N+NP ( 1 ) 


0 


•9 




NP < 1 )+ VP+NP (2)+C0N+NP(l )+VP 


0 


.1 




NP < 1 5+VP+NP C2)+C0N+NP < 1 )+VP+VM 


3 


3.3 




NP< 1 )+VP+VMf CON+NP ( 1 ) 


5 


8.2 


1.3 


EXPECTED FREQe LESS THAN 5.0 


6 


2.9 




NP ( 1 5+VP+VMf CON+SP ( 1 )+VP 


0 


.4 




NP < 1 5+ VP+ VPH-CON+NP ( 1 5+VP+ VM 


5 


2.2 




NP< 1 5+VP+NP (25+VM+CON+NP < 1 5 


11 


5.3 


5.5 


EXPECTED FREQ. LESS THAN 5.0 


0 


2.0 




NP < 1 5+VP+NP (25 +VM+C ON+NP < 1 5+VP 


0 


•3 




NP < 1 5+VP+NP (25+VM+CON+NP ( 1 5+VP+VM 


0 


2.2 




RESIDUAL 


387 


1387.0 


406.3 


TOTAL 






25 


DEGREES OF FREEDOM 
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00112 




TABIE 42 (continued) 



SCOTT-FOBESMAN HEADER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 






81 


89.6 


.8 


NPO )+VP+NP(2) 






10 


11.6 


.2 


NPO )+VP+NP(2)+K 






79 


68.9 


1.5 


NPO )+VP+NP(2)+VM 






16 


13.2 


.6 


C+NPO )+VP+NP(2) 






1 


1.7 




C+NPO )+VP+NP(2)+K 






8 


10.1 


.4 


C+NPO )+VP+NP(2)+VM 






21 


19.5 


.1 


VMf NPO )+VP+NP(2) 






1 


2.5 




VMf NPO )+VP+NP(2)+K 






15 


15.0 


.0 


VMfNPO)+VP+NP<2>+VI«] 




8 


40.4 


26.0 


NPO)+VP 






117 


75.1 


23 . o 


NPO)+VP+VM 






0 


3.6 




NPO )+VP+K 






2 


7.9 


4.4 


EXPECTED FREQ. LESS 


THAN 


5.0 


1 


6.8 


4.9 


NPO )+VP+VMf K 






9 


2.9 




R+NPO )+VP 






0 


5.4 


5.4 


R+NPO )+VP+VM 






0 


.3 




R+NPO )+VP+K 






0 


.5 




R+NPO )+VP+VWfK 






1 


3.5 




C+NPO )+VP 






10 


7.2 


i.l 


EXPECTED FREQ. LESS 


THAN 


5.0 


9 


6.6 


.9 


C+NPO )+VP+VM 






0 


.3 




C+NPO )+VP+K 






1 


.6 




C+NPO )+VP+VMfK 






39 


36.2 


.2 


VMf NPO )+VP 






73 


67.4 


.5 


VMf NPO 5+VP+VM 






1 


3.3 




VMf NP(1)+VP+K 






0 


6.1 


6.1 


VMf NP O )+VP+VMf K 






5 


8.1 


1.2 


VP+NP(2) 






4 


1.1 




VP+NP(2)+K 






6 


5.2 


.1 


EXPECTED FREQ. LESS 


THAN 


5.0 


1 


6.3 


4.4 


VP+NP(2)+VM 






7 


2.4 




VMf VP+NP<2) 






0 


.3 




VMf VP+NP(2)+K 






3 


1.8 




VMf VP+NP(2)+VM 






15 


16.6 


.2 


VP 






24 


30.9 


1.5 


VP+VM 






16 


2.8 




VP+VMfK 






26 


7.3 


47.4 


EXPECTED FREQ. LESS 


THAN 


5.0 


0 


.8 




VMf VP+VMfK 






7 


1.5 




VP+K 






1 


.4 




VMf VP+K 







i 






vh n A. n i t i 



TABLE k2 (continued) 



3.09 



00113 



2 


4.9 




10 


7.7 


.7 


2 


9.1 


5.5 


9 


9.0 


.0 


2 


2.0 




1 


1.0 




0 


1.9 




4 


1.2 




7 


6.2 


.1 


1 


1.5 




0 






0 


.9 




1 


.2 




2 


2.4 




4 


5.2 


.3 


3 


1.5 




6 


4.4 




9 


5.9 


1.6 


0 


2.7 




2 


5.5 


2.2 


5 


3.4 




5 


€.2 


.2 


3 


6.4 


1.8 


8 


1 J.2 


.5 


7 


1.6 




3 


1.0 




0 


1.9 




5 


3.0 




15 


7.5 


7.5 


0 


.7 




0 


1.3 




6 


1.7 




2 


3.2 




8 


7.0 


.2 


0 


.2 





0 .4 

O * «5 

ERIC 0 - 9 

10 9.0 .1 



VtttVP 

EXPECTED FREQ. LESS THAN 5.0 
VWfVP+VM 

NP(13+VP+VWfNP<23 
NP< 13+ VWfVP+VM 
C+VWf NP(1 )+VP 

VP+C+VP 
VP+C+ VP+NP (23 

EXPECTED FREQ? LESS THAN 5.0 

VP+C+VP+VM 
VP+C+VP+K 
VP+C+ VP+NP (23+VM 
VP+C+ VP+NP (23+K 
NP(13+VP+C+VP 

EXPECTED FREQ. LESS THAN 5.0 

NP(!)+VP+C+ VP+NP (2) 

VP < 1 3+VP+C+VP+VM 

EXPECTED FREQ. LESS THAN 5.0 

NP < 1 )+ VP+C+ VP+NP (23+VM 

NP< 1 3+C+NP <1 3+VP 

NP ( 1 3+C+NP < 1 3+VP+NP <2 ) 

EXPECTED FREQ. LESS THAN 5.0 

NP < 1 )+C+NP < 1 3+VP+NP (23+VM 
NP ( 1 3+C+NP < 1 3+VP+VM 
VWf NP<1 3+C+NP(13 + VP 
VWfNPd 3+C+NP <1 3+VP+NP <23 
VWfNPd 3+C+NP <1 3+VP+NP (23+VM 
VWf NP < 1 3 +C+ NP ( 1 3 + VP+ VM 

EXPECTED FREQ. LESS THAN 5.0 

VP+NP (23+C+NP <23 
VP+NP <2 3+C+NP (23+VM 
NP( 1 3+VP+NP (23+C+NP (23 
NP< 1 3+VP+NP(23+C+NP(23+VM 

EXPECTED FREQ. LESS THAN 5.0 

VWf VP+NP (23+C+NP (23 
VWf VP+ NP (23 +C+ NP <2 3 + VM 
VWf NP < 1 3 + VP+ NP <2 3 +C+ NP <2 3 
VWf NP < 1 3 + VP+ NP <2 3 +C+ NP (2 3 + VM 



NP< 1 3+VP+A 



110 



i 




00114 







TABLE 42 


(continued) 








I 


3 


2.1 




NP< 1 )+VP+VM+A 




i 

j 


2 


1.2 




NP<1)+VP+A+K 




1 

j 


6 


5.3 


.1 


EXPECTED FREQ. LESS THAN 5.0 




j 


0 


•3 




NP ( 1 )+ VP+VM+A+K 






5 


6.9 


.5 


NP(1 )+VP+A+VM 




1 


1 


1.6 




NP ( 1 )+ VP+ VM** A+ VM 






1 


1.0 




VP+VMfA+K 






1 


.7 




NP(l)+VP+A+0+VP 






1 


1.3 




NP(l)+VP+A+0+VP+VM 






6 


2.1 




VP+NP<2)+VP 


f 




10 


6.9 


1.3 


EXPECTED FREQ. LESS THAN 5.0 


i 

f, 




3 


3.8 




VP+NP <2>+VP+VM 


fj 




2 


5.0 




NPCH+ VP+NP <2>+VP 


3f 

j 1 




5 


8.8 


1.6 


EXPECTED FREQ. LESS THAN 5.0 


£ 

i; 




12 


9.2 


.8 


NP ( 1 >+ VP+NP (2J+VP+VM 






0 


.6 




VM+VP+NP(2).+VP 


? 




0 


1.1 




VMf VP+NP (2J+VP+VM 


S: 

(• 

f 




0 


1.5 




VM**NP(1 )+VP+NP(2)+VP 




3 


2.7 




VM+ NP ( 1 )+ VP+NP £2)+VP+VM 


i 

* 




3 


5.9 


1.4 


EXPECTED FREQ. LESS THAN 5.0 


1 

i 




4 


2.1 




VP+ NP (2 ) + VP+NP (2 ) 






4 


4.0 




VP+ NP (2 )+ VP+NP (2 ) +VM 






8 


6.1 


.6 


EXPECTED FREQ. LESS THAN 5.0 


1 

i 

i 




6 


5.2 


.1 


NP ( 1 )+VP+NP <2)+VP+NP (2) 


i 




7 


9.7 


.7 


NPC1 )+VP+NP(2)+VP+NP(2)+VM 


j 




0 


.8 




VP+NP (2)+NP(l)+VP 






0 


.5 




VP+NP (2)+NP(l )+VP+VM 






0 


.2 




VP+NP (2)+NP (l l+VP+O+VP 






2 


2.0 




NP( 1 >+VP+NP(2)+NP( 1 )+VP 


I 




2 


1.2 




NP ( 1 )+ VP+NP (2) +NP ( 1 ) + VP+ VM 


■ f- 
|V 

4' 




1 


.4 




NP(1)+ VP+NP (2)+NP(l )+VP+0+VP 


f 

1 




5 


5.0 


.0 


EXPECTED FREQ. LESS THAN 5.0 


f 




1 


.6 




VP+RP(2)+NP (1 )+VP 


h 




1 


.4 




VP+RP (2>+NP ( 1 )+VP+VM 


'■ 




0 


.1 




VP+RP (2)+NP ( 1 )+VP+0+VP 






2 


1.6 




NPC1 )+VP+RP(2)+NP(l )+VP 






0 


.9 




NP ( 1 )+ VP+RP <2)+NP ( 1 )+VP4 VM 






0 


.3 




NPCD+VP+RP (2)+NP(l )+VP+0+VP 


|ER1C 


6 

21 


11.5 

7.1 


2.6 

26.9 


NP(1 )+VPvO+VP 
NP(l)+VP+0+ VP+NP (2) 






7 


13.3 


3.0 


NP ( 1 5+VP+0+ VP+NP (2)+VM 



TABLE h-2 (continued) 



18 


21.4 


.5 


NPCD+VP+O+VP+VM 


5 


3.4 




VW+ NP < 1 ) + VP+0+ VP 


10 


7.4 


.9 


EXPECTED FREQ, LESS THAN 5.0 


4 


2.1 




VN+NPU J+VP+O+VP+NP (2) 


0 


3.9 




VW- NP < ! J+VP+O+VP+NP <2 >+VM 


4 


6.0 


.7 


EXPECTED FREQ. LESS THAN 5.0 


7 


6.3 


.1 


VM+ NP < 1 ) + VP+0+VP+ VM 


6 


4.2 




NP < 1 )+VP+NP(2)+0+VP 


5 


2.1 




NP C 1 J+VP+NP (2J+0+VP+NP (2) 


11 


6.3 


3.5 


EXPECTED FREQ. LESS THAN 5.0 


0 


.0 




NP(1)+VP+NP <2)+0+VP+NP<2)+VP 


6 


7.8 


.4 


NP C 1 J+VP+NP <2)+0+VP+VM 


1 


3.9 




NP < 1 J+VP+NP <2)+0+VP+NP(2)+VM 


0 


.0 




NP ( 1 J+VP+NP <2)+0+VP+NP <2)+VP+VW 


5 


5.0 


.0 


NP ( 1 )+C+NP <1 J+VP+O+VP+VM 


0 


.0 




NP ( 1 J+VP+VM+O+VP 


0 


.0 




NP C 1 )+VP+0+VP+NP <2)4- VP 


0 


.0 




NP ( 1 )+VP+C0N+0+VP 


0 


1.1 




NPC 1 )+VP+CON>NP <1 ) 


2 


.9 




NP C 1 )+VP+CON+NP < 1 )+VP 


3 


5.8 


1.4 


EXPECTED FREQ. LESS THAN 5.0 


4 


1.1 




HP C i KVP-fCON+SP (1 >-8*VP4*VM 


0 


.7 




NP ( 1 )+ VP+NP <2 ) +CON+NP < 1 ) 


0 


.5 




NP(l)+VP+NP(2)+C0N+NP(i)fVP 


0 


.7 




NP < 1 )+VP+NP <2)+C0N+NP < 1 ) + VP+VM 


5 


2.0 




NP ( 1 )+VP+VMfCON+NP < 1 ) 


2 


1.6 




NP ( 1 )+VP+VM+ CON+NP < 1 )+VP 


11 


6.6 


3.0 


EXPECTED FREQ. LESS THAN 5.0 


1 


2.0 




NP ( 1 )+VP+VMf CON+NP < 1 )+VP+VM 


0 


1.2 




NP ( 1 )+VP+NP <2)+VMf CON+NP < 1 ) 


0 


1.0 




NP ( 1 )+VP+NP(2)+VMf CON+NP < 1 )+ VP 


0 


1.2 




NP( 1 )+VP+NP <2)+VMfC0N+NP(l )+VP+VM 


1 


5.5 


3.7 


EXPECTED FREQ. LESS THAN 5.0 


0 


-.0 




RESIDUAL 


835 


835.0 


206.0 


TOTAL 



15 DEGREES OF FREEDOM 



112 



00116 



TABLE 42 ( continued) 



READERS COMBINED 



OBSERV. EXPECT. CHI**2 SOURCE 



266 


278.2 


.5 


16 


21.0 


1.2 


207 


189.9 


1.5 


35 


29.6 


1.0 


1 


2.2 




16 


20.2 


.9 


57 


54.0 


.2 


1 


4.1 





NP(l)fVPfNP<2) 
NP(l)fVPfNP(2)fK 
NP(l)fVPfKP<2)fVM 
CfNP(l)fVPfNP(2> 
CfNP(l)fVPfNP(2)fK 
Cf NP<l)fVPfNP(2)fVM 
VMfNP(l)fVPfNP<2) 
VMf NP(1 )f VPf NP (2)fK 



2 



6.3 2.9 EXPECTED FREQ. LESS THAN 5.0 



37 


36.9 


.0 


35 


113.4 


54.2 


348 


234.4 


55.1 


1 


13.1 


11.2 


4 


27.1 


19.7 


12 


3.5 




0 


7.2 


7^2 


0 


.4 




0 


.8 




12 


16.7 


1.3 


44 


34.4 


2.7 


0 


1.9 





VMf NP <1 >f VPf NP <2)f VM 

NP(l)fVP 

NP( 1 )f VPf VM 

NPOJfVPfK 

NP(l)fVPfVWfK 

Rf NP(1 )f VP 

RfNP(l)fVPfVM 

RfNP(l)fVPfK 

Rf NP(1 )f VPfVMfK 

Cf NP (1 )f VP 

Cf NP(i 5f VPf VM 

CfNP(l)fVPfK 



12 



6.7 4.2 EXPECTED FREQ. LESS THAN 5.0 



1 


4.0 




Cf NPC1 )f VPf VMfK 


115 


86.5 


9.4 


VMf NP(1 )f VP 


1 ?3 


178.8 


.2 


VMf NP(l)f VP+VM 


6 


10.0 


1.6 


VMf NP(1 )f VPfK 


2 


20.7 


16.9 


VMf NP(l)f VPfVMfK 


20 


23.7 


.6 


VPf NP(2) 


6 


1.8 




VPf NP(2)fK 



7 5.8 



.3 EXPECTED FREQ. LESS THAN 5.0 



1 

18 

1 



16.2 14.3 

4.7 
.4 



VPfNP(2)fVM 
VMf VPf NP (2) 
VMf VPf NP(2)fK 



19 



5.1 38.4 EXPECTED FREQ. LESS THAN 5.0 



O 

ERIC 



4 

46 

75 

54 

0 



3.2 

53*9 

111.4 

12.9 

2.6 



VMf VPfNP(2)fVM 
1.2 VP 
11.9 VP+VM 
131.1 VPfVMfK 

VMf VP+VMfK 



5.8 .5 



4 



EXPECTED FREQ. LESS THAN 5.0 



00117 




TABLE 42 (continued) 



3 

ERIC 



32 


6.2 


106.4 


1 


1.2 




2 


10.7 


7.1 


11 


22.1 


5.6 


13 


13.0 


0 


5 


5.0 


»o 


7 


7.0 


.0 


2 


9.3 


5.7 


12 


6.1 


5.8 


8 


6.3 


.4 


3 


.7 




0 


4.1 




4 


6.1 


.7 


2 


.5 




9 


5.3 


*/\ 

• 

CO 


9 


3.5 




9 


11.0 


.4 


0 


7.2 


7.2 


4 


10.7 


4.2 


8 


7.0 


.1 


€ 


14.5 


5.0 


24 


22.1 


.2 


12 


2.1 




23 


6.1 


47.3 


4 


1.4 




1 


2,9 




€ 


4.4 




11 


8.6 


.6 


0 


2.4 




0 


5.0 




0 


7.4 


7.4 


17 


4.9 




6 


10.2 


1.7 


0 


.5 




17 


5.4 


24.8 


0 


1.0 




4 


1.0 




0 


2.0 




23 


23.3 


.0 


5 


9.1 


1.8 


3 


1 .8 





VP+K 
VMf VP+K 
VMf VP 
VPH-VP+VM 

NP< I 3+VP+VM+NPC23 
NP<13+VMfVP+VM 
C+VM+NPCI 3+VP 

VP+C+VP 

VP+C+ VP+NP <23 

VP+C+VP+VM 

VP+C+VP+K 

VP+C+ VP+NP (23 +VM 

'EXPECTED FREQ. LESS THAN 5.0 

VP+C+ VP+NP <2)+K 

NP(1)+ VP+C+VP 

NP< 1 ) + VP+C+VP+NP<2) 

VP< 1 3+VP+C+VP+VM 

NP < I 3 + VP+C+ VP+ NP <2 3 + VM 

NP<1 )+C+NP(! )+VP 

NP < 1 )+C+NP Cl 3+VP+NP (2) 

NP < 1 ) +C+ NP C 1 ) +VP+ NP (2 )+ VM 
NP < 1 3+C+NP < 1 3+VP+VM 
VM+ NPC1 3+C+NP <1 )+VP 

EXPECTED FREQ. LESS THAN 5.0 

VMf NP < 1 3+C+NP ( I 3+VP+NP <23 
VMf NP < I 3+C+NP ( 1 3 + VP+NP <2 3+ VM 
VMf NP < 1 3+C+NP ( 1 3+VP+VM 

EXPECTED FREQ. LESS THAN 5.0 

VP+NP <2 3+C+NP <23 
VP+NP <23+C+NP<23+VM 

EXPECTED FREQ. LESS THAN 5.0 

NP < 1 3+ VP+NP <2 3 +C+ NP <2 3 
NP <13+ VP+NP C23+C+NPC23+VM 
VMf VP+NP C23+C+NP <23 

EXPECTED FREQ. LESS THAN 5.0 

VMf VP+NP (23+C+NP <23+ VM 
VMf NP Cl 3+VP+NP C23+C+NP <23 
VMf NP < 1 3+VP+NP <2 3+C+NP C23+VM 



NP< 1 3+VP+A 
NP<13+VP+VMfA 
NP< 1 3+VP+A+K 



00118 



TABLE 42 (continued) 



114 



7 


3.7 


.3 


EXPECTED FREQ. LESS THAN 


5.0 


0 


.7 




NP<l>+VP+Vl1tA+K 




15 


15.9 


.1 


NPCD+VP+A+VM 




11 


6.2 


3.7 


NP ( 1 )+VP+VMfA+VM 




5 


5.0 


.0 


VP+VPH-A+K 




1 


2.3 




NPCD+VP+A+O+VP 




6 


4.7 




NP<n+VP+A+0+VP+VM 




7 


7.7 


.1 


EXPECTED FREQ. LESS THAN 


5.0 


12 


4.7 




VP+NPC2J+VP 




11 


9.6 


.2 


VP+ NP(2)+VP+VM 




6 


9.5 


1.3 


NP < 1 )+VP+NP <2)+VP 




19 


19.6 


.0 


NP < 1 )+VP+NPC2)+VP+VM 




0 


.9 




V«f VP+NPC2J+VP 




12 


5.6 


7.4 


EXPECTED FREQ. LESS THAN 


5.0 


0 


1.9 




VMf VP+NP C2)+VP+VM 




0 


1.9 




Vttf NPCD+VP+NP <2)+VP 




4 


3.9 




VMf NPCD+ VP+NP (2)+VP+VM 




4 


7.7 


1.8 


EXPECTED FREQ. LESS THAN 


5.0 


6 


4.0 




VP+NP <2)+VP+NP<2) 




10 


8.2 


.4 


VP+NP(2)+VP+NP(2)+VM 




9 


8.1 


.1 


NP C 1 )+ VP+NP C2)+VP+NP <2 ) 




12 


16.7 


1.3 


NP C l )+VP+NPC2)+VP+NP <2)+VPI 


0 


2.2 




VP+NPC2J + NPC1 )+VP 




6 


6.2 


.0 


EXPECTED FREQ. LESS THAN 


5.0 



0 

0 

6 



3 

1 

5 



3 

0 

7 

0 

2 



ERIC 



.7 

.4 

4.5 

5.6 

1.5 
.7 

3.7 

6.0 

1.2 

.6 

7.6 
2.5 
1.3 

5.7 



VP+NPC2)+NPC1)+VP+VM 
VP+NP C2)+NP <1 )+VP+0+VP 
NP ( I )+VP+NPC2)+NPCl >+VP 

.0 EXPECTED FREQ. LESS THAN 5.0 

NPCn+VP+NPC2)+NPCl)+VP+VM 
NP C 1 )+VP+NP C25+NPC 1 >+VP+0+VP 
VP+RPC25+NP Cl )+VP 

1.5 EXPECTED FREQ. LESS THAN 5.0 



.0 



.1 



VP+RP C2)+NP ( 1 J+VP+VM 
VP+RP C25+NP ( 1 >+VP+0+VP 
NPC1)+VP+RPC2)+NPC1)+VP 
NP( 1 )+ VP+RP <2)+NP Cl J+VP+VM 
NP ( 1 )+ VP+RP (2)+NP <1 )+VP+0+VP 



EXPECTED FREQ. LESS THAN 5.0 



115 



00119 



. .. ; ■' * ■ 

yvj iji jtj 



TABLE 1+2 (continued) 



f ) 


8 


21.2 


8.2 


NP<l) + VP+0+VP 


1 


4! 


13.9 


52.9 


NP(l)+VP+0+VP+NP (2) 


l... 


17 


28.7 


4.8 


NP(1 )+VP+0+VP+NP <2)+VM 






41 


43.8 


.2 


NP<l)+VP+0+VP+VM 






7 


4.2 




VWf NP< 1 )+VP+0+VP 






5 


2.8 




VWf NP < 1 ) + VP+0+VP+ HP <2 ) 


; 




12 


7.0 


3.7 


EXPECTED FREQ. LESS THAN 5.0 


; 




0 


5.7 


5.7 


VMf NP < 1 ) + VP+O+VP+NP <2 ) + VM 






10 


8.7 


.2 


VHf NP ( 1 )+ VP+O+VP+VPI 






11 


6.8 


2.5 


NP(l)+VP+NP(2)+0+VP 






6 


2.6 




NP(l)+VP+NP<2)+0+VP+NP<2) 




0 


.0 




NP(l)+VP+NP<2)+0+VP+NP(2)+VP 


; 




10 


14.2 


1.2 


NP < l >+VP+NP <2>+Q+VP+VM 




L ■ 


2 


5.4 


2.1 


NP( 1 )+VP+NP <2)+0+VP+NP <2)+VM 




0 


.0 




NP < 1 >+VP+NP <2>+0+VP+NP <2)+VP+ VM 


i i 


[ 


5 


5.0 


.0 


NP ( 1 >+C+ NP < 1 )+VP+0+VP+ VM 


i 


j 


5 


5.0 


.0 


NP< 1 )+VP+VMf O+VP 




3 


3.0 




NP < 1 )+ VP+O+VP+NP <2)+VP 


; i 


0 


9 


5.6 


2.1 


EXPECTED FREQ. LESS THAN 5.0 


! i 




5 


5.0 


«o 


NP < 1 ) + VP+C ON+O+VP 


[•• ! 


1 : 


0 


2.6 




NP < 1 )+VP+CON+NP < 1 ) 




3 


2.2 




NP( 1 )+VP+CON+NP (l )+VP 


f 1 


[i 


5 


1.2 




NP< 1 )+VP+CON+NP <1 )+VP+VM 


r I 




8 


5.9 


.7 


EXPECTED FREQ. LESS THAN 5.0 


;■ | i 


0 


1.7 




NP( 1 )+VP+NP <2)+C0N+NP < 1 ) 


; lj 


0 


1.4 




NP < 1 ) + VP+N P <2 ) +C ON+NP < 1 ) +VP 


j 


0 


.8 




NP < 1 )+VP+NP(2)+C0N+NP < 1 )+VP+ VM 




['! 


8 


5.3 


1.4 


NPC 1 )+VP+VMfCON+NP<l ) 


I i 


! i 


8 


4.5 




NP < 1 )+VP+VMtCON+NP(l )+VP 


5 | 


[ l 


8 


8.4 


.0 


EXPECTED FREQ. LESS THAN 5.0 


; ! 


Lj 


1 


2.4 




NPC I ) + VP+VMfCON+NP(l )+VP+VM 


; 


r 


5 


3.5 




NP(l)+VP+NP<2)+VMtC0N+NP(l) 




1] 


6 


5.9 


.0 


EXPECTED FREQ. LESS THAN 5.0 






0 


2.9 




NP< 1 )+VP+NP(2)+VMfC0N+NP <1 ) + VP 






0 


1.6 




NP(l)+VP+NP<2)+VWtC0N+NP(l)+VP+VM 






- o 


4.5 




RESIDUAL 


0 

FRir 


2222 


2222.0 


729.2 


TOTAL 


L i\l v> ■ 






44 


DEGREES OF FREEDOM 



116 



TABLE k-2' (continued) 



00120 



ALL COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


653 


668.1 


.3 


31 


54.6 


10.2 


363 


324.3 


4.6 


47 


45.3 


.1 


1 


3.7 




23 


22.0 


.0 


88 


81.0 


.6 


1 


6.6 


4.8 


38 


39.3 


.0 


108 


223.8 


59.9 


601 


354.0 


172.4 


3 


54.7 


48.9 


7 


86.5 


73.1 


20 


6.5 


27.7 


1 


10.3 


8.4 


0 


1.6 





SOURCE 

NP< 1 )+VP+NP (2) 
NPd)+VP+NP<2)+K 
NPd)+VP+NP<2)+VW 
C+NPd)+VP+NP<2> 
C+NPd )+VP+NP(2)+K 
C+NPd >+VP+NP<2)+VM 
VWf NP < 1 )+ VP+NP C2) 

VMf NPd ?+ VP+NP (2>+K 

VMf NPdJ+ VP+NP (2>+VM 

NPd>+VP 

NP < 1 )+VP+VM 

NP ( 1 )+VP+K 

NPd)+VP+VM+K 

R+NPd)+VP 

R+NPd>+VP+VM 

R+NPd>+VP+K 



1 



5.3 3.5 EXPECTED FREQ. LESS THAN 5.0 



0 

ERIC 



0 


2.5 




26 


27.1 


.0 


59 


42.8 


6.1 


0 


6.6 


6.6 


2 


10.5 


6.9 


268 


164.0 


65.9 


240 


259.5 


1.5 


17 


40.1 


13.3 


2 


63.4 


59.5 


124 


126.9 


• 1 


34 


10.4 


53.9 


27 


61.6 


19.4 


18 


12.9 


2.0 


3 


1.1 




13 


6.3 


7.3 


95 


214.5 


66.6 


253 


339.2 


21.9 


221 


82.9 


230.0 


3 


8.4 


3.5 


156 


52.4 


204.7 


4 


5.3 


.3 


4 


21.8 


14.5 


23 


34.4 


3.8 


13 


13.0 


• 0 


7 


7.0 


.0 


9 


9.0 


.0 


28 


52.5 


11.4 


72 


38.2 


30.0 


25 


25.5 


.0 



R+ NP d ) + VP+ VK+K 
C+NPd >+VP 
C+NPd )+VP+VM 
C+NP d )+VP+K 
C+ NP d ) + VP+ VWf K 
VWf NP(1)+VP 
VttfNP(!)+VP+VM 
VMfNP(l)+VP+K 
VM4- NP < 1 ) + VP+ VMf K 
VP+NP<2> 
VP+NP(2)+K 
VP+NP<2)+VM 
VWf VP+NP(2) 

VMf VP+NP(2)+K 
VWf VP+NP <2)+VM 
VP 

VP+VW 

VP+VWfK 

VWfVPfVWfK 

VP+K 

VWfVP+K 

VMf VP 

VWfVP+VM 

NP < 1 )+ VP+VMf NP (2) 
NP < 1 )+VMf VP+VM 
C+VWf NP(1 )fVP 

VP+C+VP 
VP+C+ VP+NP <2) 
VP+C+ VP+VM 
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8 


4.3 




VP+C+VP+K 


11 


7.9 


1.3 


EXPECTED FREQ. LESS THAN 5.0 


3 


18.5 


13.0 


VP+C+ VP+NP(2)+VM 


6 


3.1 




VP+C+ VP+NP <2 )+K 


23 


11.2 


12.4 


NPd )+VP+C+VP 


13 


8.2 


2.9 


NPd )+VP+C+ VP+NP <2) 


13 


17.7 


1.3 


VP(1 ) + VP+C+VP+VFl 


1 


12.9 


1 1 .0 


NPd )+VP+C+VP+NP<2>+VM 


10 


23.6 


7.8 


NP ( 1 )+C+NP ( 1 )+VP 


19 


17.2 


.2 


Np < 1 ) +C+ NP (1 ) +VP+ NP (2 ) 


12 


27.2 


8.5 


NPC1 )+C+NP(l )+VP+NP(2)+VM 


39 


37.3 


.1 


NP( 1 )+C+NPd )+VP+VM 


21 


2.4 




VMt NPd )+C+NP < 1 )+VP 


27 


5.5 


83.7 


EXPECTED FREQ. LESS THAN 5.0 


4 


1.7 




VMt NP d )+C+ NP < 1 )+ VP+NP (2) 


2 


2.8 




VMt NP < 1 >+C+ NP d ) + VP+NP <2 > + VM 


9 


3.8 




VMtNPd)+C+NPd)+VP+VM 


15 


8.3 


5.4 


EXPECTED FREQ. LESS THAN 5.0 


15 


10.0 


2.5 


VP+NP C2)+C+NP<2) 


0 


15.8 


15.8 


VP+NP (2)+C+NP (2)+VM 


33 


11.5 


40.4 


NP( t ) + VP+NP (2)+C+NP <2) 


6 


18.2 


8.1 


NP Cl )+VP+NP(2)+C+NP(2)+VM 


0 


1.0 




VMt VP+ NP<2)+C+NP (2) 


1 


1.6 




VMt VP+ NP <2 ) +C+ NP <2 )+ VM 


6 


1.2 




VMt NP(1)+VP+NP(2)+C+«P<2) 


0 


1.8 




VMtNP <1 )+VP+NP (2)+C+NP<2)+VM 


7 


5.6 


.3 


EXPECTED FREQ. LESS THAN 5.0 


62 


63.2 


.0 


NP(1)+VP+A 


10 


14.0 


1.2 


NP C 1 )+VP+VM+A 


6 


5.2 


.1 


NP ( 1 > + VP+A+K 


0 


1.1 




NP ( 1 ) + VP+ VMt A+K 


31 


30.7 


.0 


NP( 1 )+VP+A+VM 


12 


6.8 


3.9 


NP(1) + VP+VM+A+VM 


5 


5.0 


.0 


VP+VMtA+K 


7 


7.7 


.1 


NP ( 1 )+ VP+A+O+VP 


13 


12.3 


.0 


NP(1 ) + VP+A+0+VP+ VM 


42 


20.4 


22.8 


VP+NP (2)+VP 


32 


32.3 


.0 


VP+NP (2)+VP+VK 


15 


23.5 


3.1 


NP(1 )+VP+NP(2)+VP 


29 


37.2 


1.8 


NP(1 ) + VP+NP (2)+VP+VH 


0 


2.1 


i 

1 


VMt VP+NP C2J+VP 


0 


3.3 


1 

i 


VMt VP+NP (2)+VP+VM 


0 


6.5 


6.5 


EXPECTED FREQ. LESS THAN 5.0 
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O 

ERIC 



0 


2,4 




VM+NPO 5+VP+NP (25+VP 


7 


3.8 




VM+NPO 5+VP+NP (25+VP+VM 


7 


6.2 


.1 


EXPECTED FREQ. LESS THAN 5.0 


18 


11.2 


4.2 


VP+NP (25+ VP+NP (2) 


15 


17.7 


.4 


VP+NP (2 5 + VP+NP (25+VM 


15 


12.9 


.4 


NPd 5+VP+NP (25+VP+NP (25 


14 


20.3 


2.0 


NPd 5+VP+NP (25+VP+NP (25+VM 


21 


24.6 


.5 


VP+NP (2) + NP(l) + VP 


2 


5.5 


2.2 


VP+NP(25 + NP (1 5+VP+VM 


0 


2.0 




VP+ NP (2 ) + NP ( 1 ) + VP+ 0+ VP 


39 


28.3 


4. 1 


NP( 1 )+VP+NP(2)+NP(l )+VP 


5 


6.3 


.3 


NP ( 1 5+VP+NP (25+NP ( 1 5+VP+VM 


2 


2.4 




NP ( 1 5+VP+NP (25+NP ( 1 5+VP+O+VP 


5 


8.9 


1.7 


VP+RP (25 + NP (15 + VP 


8 


2.0 




VP+RP (25 + NP (15+VP+VM 


10 


6.4 


2.1 


EXPECTED FREQ. LESS THAN 5.0 


0 


.7 




VP+RP (25 + NP (1 5+VP+O+VP 


7 


10.2 


1.0 


NPd 5+VP+RP (25+NP (15+VP 


1 


2.3 




NP ( 1 5+VP+RP (25 +NP ( 1 5+ VP+VM 


4 


.9 




NP ( 1 5+VP+RP (25+NP ( 1 5+VP+O+VP 


33 


59.6 


11.9 


NP(1 5+VP+O+VP 


92 


43.4 


54.5 


NP(15+VP+0+ VP+NP (25 


31 


68.6 


20.6 


NP( 1 5+VP+0+ VP+NP (25+VM 


110 


94.3 


2.6 


NP ( 1 5+VP+0+ VP+VM 


9 


6.1 


1.4 


VM+NPO 5 + VP+O+VP 


7 


4.4 




VM+ NP ( 1 5 + VP+0+ VP+NP ( 2 5 . 


12 


8.3 


1.7 


EXPECTED FREQ. LESS THAN 5.0 


0 


7.0 


7.0 


VMtNP(15+VP+0+VP+NP (25+VM 


11 


9.6 


.2 


VPH- NP ( 1 5 + VP+O+VP+VM 


25 


17.0 


3.7 


NP(1 5+VP+NP (25+0+VP 


21 


10.1 


11.9 


NP(1 5+VP+NP (25+O+VP+NP (25 


0 


.0 




NP ( 1 5+VP+NP (25+O+VP+NP (25+VP 


19 


27.0 


2.3 


NP( 1 5+VP+NP (25+0+ VP+VM 


5 


15.9 


7.5 


NP ( 1 5+VP+NP (25+O+VP+NP (25+VM 


0 


.0 




NPd 5+VP+NP (25+0+VP+NP(25+VP+Vl j ' l l 


5 


5.0 


.0 


NP ( 1 5+C+NP ( 1 5+ VP+O+VP+VM 


5 


5.0 


.0 


NP ( 1 5+VP+VM+O+VP 


9 


9.0 


.0 


NP(15+VP+0+ VP+NP (|5+ VP 


5 


5.0 


.0 


NP ( 1 5+ VP+C 0N+0+ VP 


0 


2.9 




NP(15+VP+C0N+NP(H 


5 


2.9 




NPd 5+VP+CON+NP (1 Ji+VP 


5 


3.8 


.1 


EXPECTED FREQ. Lfcisfi THAN 5.0 


6 


1.6 




NP ( 1 5+VP+CON+NP ( 15+VP+VM 


0 


2.1 




NP d 5+VP+NP (25+COM+NP ( 1 5 


0 


2.1 




NP ( 1 5+VP+NP (25+COfl+NP (15+VP 
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TABLE k2 (continued) 






6 


5.8 


.0 


EXPECTED FREQ. LESS THAN 5.0 


o 


1 .1 




NP<1 5+VP+NP (25+CON+NP <1 5+VP+VM 


8 


4.6 




NP ( 1 >+ VP+VPH-CON+NP < 1 ) 


8 


5.8 


.9 


EXPECTED FREQ. LESS THAN 5.0 


8 


4.6 




NP < 1 >+VP+VMfCON+NP< 1 )+VP 


1 


2.5 




NP < 1 5+VP+ VMfCON+NP < 1 5+VP+VM 


9 


7.1 


.5 


EXPECTED FREQ. LESS THAN 5.0 


5 


3.4 




NP< 1 5+VP+NP(25+VM+CON+NP< 1 ) 


0 


3.4 




NP < 1 5+VP+NP <25+ VM+CON+NP < 1 5+VP 


5 


6.7 


.4 


EXPECTED FREQ. LESS THAN 5.0 


0 


1.8 




NP < 1 5+VP+NP (25+VttfCON+NP < 1 5+VP+VM 


0 


2.0 




RESIDUAL 


4822 


4822.0 


1644.0 


TOTAL 






61 


DEGREES OF FREEDOM 



I 



I 

I: 

r 

;■ 

i 

i 



i 

i 

j 

! 

i 

i 

f 

i 

f 
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Within the pre-primers combined., seven of the ten highest ranking 
types of the Ginn series are among the ten highest ranking types of the 
Scott-Foresman series, and within the primers and readers eight of the ten 
highest ranking types are common to botho In the primers the first three 
high frequency types hold corresponding ranks in the Ginn and Scott-Foresman 
books, and in the readers the first six high frequency types hold corresponding 
ranks. The type (NP. . n +VP. .+NP 0 ) is the most frequent pattern in the 

pre-primers and the primers and is the second most frequent pattern in the 
readers,' in the readers the type (NP. . ,+VP. ,+VM) is most frequent. For 
all sections the majority of high frequency types can be generated from 
the rules for statements with simple subjects and predicates and without 
embedaingo 

The maximum-likelihood estimates for the rule choice parameters, 

A l" A 2V inciicai:e some changes in structural patterns from the pre-primers 
to the readers. Table 44 shows the rank order of the corresponding 
estimated values across sections; "l" indicates the largest value and "3" 
indicates the smallest. 

Insert Table 44 about here 

Within the statements without embedding there is a sharp decrease in 
the estimates for Ay A^, Ag, and A^ from the pre-primers to the readers. 

These are probabilities of statements which are (or in the case of A^ 
may be) imperatives, and include one and two word utterances such as 
"Run." and "Help Dick." The proportion of statements which include 
subjects or objects or both increases from the pre-primers to the readers, 
and with the exception of A^ the estimates for the probabilities of 
statements with embedding are smallest for the pre-primers. Again A^ 
is the parameter for a rule which may be an imperative. Estimates for 
B8 and B8 2 , the parameters for including or deleting the subjective noun 
phrases, indicate that for the pre-primers the subject is deleted twice 
as often as not; thus when rule 13 with probability A^ or rule l4 with 
probability A^ is chosen for the pre-primers, the subjective noun phrase 
is usually deleted and the result is an imperative statement. Contrary 
to the Strickland finding and more in line with common- sense expectations, 
some evidence for a development of sentence structure was found in the 
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TABLE 14 

Rank Order of the Maximum-, dkelihood Estimates, 
A l” A 2V across Sections of the Corpus 



Parameter 


Pre- Primers 
Combined 


Primers 

Combined 


Readers 

Combined 


A i 


3 


2 


1 


A 2 


3 


2 


1 


*3 


1 


2 


3 


\ 


1 


2 


3 


A 5 


2.5 


2.5 


1 


A 6 


3 


2 


1 


*7 


2 


3 


1 


A 8 


1 


2 


3 


A 9 


3 


1 


2 


A io 


3 


2 


1 


A_ . 


1 


3 


2 


11 


A 12 


3 


1 


2 


A i3 


2-5 


2-5 


1 


a i4 


3 


1 


2 


A i5 


1.5 


1.5 


3 


a i6 


3 


2 


1 


A_ _ 


2 


1 


3 


If 


A i8 


3 


1 


2 


VO 


3 


1 


2 


A 20 


2.5 


2.5 


1 


A 21 


2.5 


2.5 


1 


A 22 


3 


1 


2 
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corpus; short statements without embedding were more probable in the 
pre-primers while longer statements and statements with embedding were 
more probable in the readers . 

Table 42 indicates that a large percent of each of the total chi- 

squares is due to a few types which contribute very large chi-square 

values, rather than to many types with rather large chi-square contributions. 

For the corpus as a whole, for example, subtraction of the ten largest 

chi-square contributions yields a total chi-square of 573«7> a reduction 

of 1070.3 or 65 .I#. Thus, in general, the grammar provided a good fit; 

again, however, the number of degrees of freedom is relatively small, and 

some sort of fit would be expectedo Only one type, (NP. . _+VP. ,+VM), 

tjJji iyJ 

was a consistently large contributer; predictions for this type were always 
too low. This type can be derived from the seconc rewrite rule; removing 
this possibility from the rule and forming instead a new rule in hopes of 
changing the parameters and providing a better fit for this type would 
involve changing the whole structure of the rule and including several 
more rules. This in turn might change some of the good predicted frequencies 
of other types derivable from tne rule and in this way again increase the 
total chi-square value. Thus it seemed best to make no change in the grammar. 

Figure 6 shows the fit of the grammar tc the observed frequencies 
and their rank order. This indicates a fit which is poorer than the 
previous grammars for high frequency types but is in general quite good. 

Insert Figure 6 about here 

Table 45 summarizes the total chi-squares and the respective degrees 
of freedom for each of the six grammars for every section of the corpus 
,xn which the statistic was applicable. Table 46 shows the ’'average 
chi-squares” found by dividing the degrees of freedom into the total chi- 
square; these values form the numerators and denominators of the F-tests 
and make pairwise comparisons somewhat easier. 

Insert Tables 45 and 46 about here 

The fits given by the statements- with- verbs grammar for the primers 
individually and combined and for the Ginn reader were not significantly 
different than those given by the noun-phrase grammar for corresponding sections 
The fit for the pre-primers combined was signf rtantly worse and the fits for 
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Fig 6- Comparison of observed and predicted values for statement with verbs (entire corpus). 
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All Combined 316.8 2 159-1 4 9-6 5 13-0 3 52-3 2 1644.0 6.1 
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the Scott -For esman reader, the readers combined, and the corpus as a whole 
were significantly better than those of the noun-phrase grammar; the latter 
were equivalent to those given by the verb-phrase grammar «, 

All of the phras° -structure grammars presented in this chapter have 
been written to fit the corpus as nearly as possible. Only a few syntactic 
patterns which do not appear in the corpus but are generally considered 
grammatical can be generated by the grammars, and no provision has been 
made for unbounded embedding. In later uses, for example an making 
comparisons with oral speech, more general grammars might be preferable; 
in that case the same grammar would be used for both corpuses but different 
parameters would be estimated. Although more general constructions and 
unbounded embedding are possible in phrase- structure grammars, they are 
somewhat easier to conceptualize in categorial grammars , With this in 
mind, categorial grammars were written for the noun phrases and verb phrases 
these are presented in the next chapter. 
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CHAPTER IV 
CATEGORIAL GRAMMARS 

For purposes of comparison, two categorial grammars, one for the 
noun phrases and one for the verb phrases, were constructed. It has been 
shown (Bar-Hillel, Gaifman & Shamir, i960) that phrase- structure grammars 
and categorial grammars are equivalent in the range of languages they are 
capable of characterizing » Categorial grammars, while less easy to interpret 
than phrase- structure grammars, are more easily adapted to fit other 
corpuses (for example oral language) because they contain at most two 
rewrite rules,, To adapt a categorial grammar to a new corpus the 
parameters must be reestimated and sometimes a few new categories must 
be added, but no new rewrite rules are needed. 

The two rewrite rules for standard categorial grammars are:: 

1) a -» a/p , p 

2) a -» p, p\a 

where a and (3 are categories. In writing a categorial grammar 
a finite number of primitive categories is selected; the primitive categories 
are usually taken to be ”s ,? for sentence and ,s n :j for noun. All primitive 
categories are categories, when when a and (3 are categories, [Gi/(3] and 
[a\(3] are categories~-these are called derived categories. All words in 
the terminal vocabulary are classified into one or more of these categories * 

A premise ( l? s* in standard categorial grammars) is chosen, and all 
grammatical types of utterances must be derivable from the premise, the 
rewrite rules, and the categories. Any utterance derivable in this way 
is considered grammatical, so the categories must be established so that 
all grammatical utterances, and only those, can be derived- The following 
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example illustrates the method of derivation in a categorial grammar. 



transitive verbs. Then the categorial symbolism for the syntact.ical 
type (N+TV+N) is (n, [ri\s]/n, n) , and this can ba generated as follows: 



Corresponding to the phrase structure grammars , parameters can be 
assigned to each choice point of the derivations, and the same statistical 
procedures for evaluating the grammars may be followed . In general fcur 
types of parameters are necessary for categorial grammars: a stopping 

parameter, a parameter for the choice of rule used, parameters denoting 
substitutions for (3, and parameters denoting substitutions for a. 

The first three parameter types represent unconditional probabilities 
and present no problem. The parameters for the choice of a, however, are 
conditional probabilities; they are conditional on the choices available 
from the previous application of the ruL*. In the example above, when rule 
2 is applied, the only choice for a is ,! s" so the probability of that 
choice is one, when rule 1 is applied in the next step, there are two 
choices available for a, "n n and "n\s ,f , and the probability of choosing 
"s" at this point is zerOo In this example three parameters for Ot are 
needed: P[s for a given s] , P[n for a given n, n\s], and P[n\s for a 
given n, n\s] ; clearly the first probability is one and the sum of the 
last two probabilities is one. Under this system, a new set of Ot parameters 
is required each time a new set of choices is available. Some 
simplification can be achieved by considering the number of the choice 
instead of the choice itself. In the example above the required parameters 
would be: P[ first choice given one choice], P[first choice given two 

choices], and P[ second choice given two choices]; again the first 
probability is equal to one and the sum of the last two is one. When the 
number of the choice is the basis for parameter assignment, the number of 
necessary sets of a parameters is equal to the maximum 

number of times the rule is applied in any derivation of the grammar. If 
the grammar provides for an infinite number of rule applications, a 
decision could be made to stop when the probability of syntactical patterns 



Let "n M be the category for nouns and [n\s]/n l s the category for 



s -> n, n\s 

n, n\s n, [n\s]/n, n 



Rule 2, s for a, n for (3 
Rule 1, [n\s] for a, n for (3 



I 
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reached a certain level close to zero; the maximum number of rule 

applicat ions would then be finite- In the noun-phrase and verb-phrase J 

grammars of this chapter, no choice of Of is available, so the problem of 

conditional parameters does not arise . j 

Categorial Grammar for Eoun Phrases 

i 

The categorial grammar for noun phrases is summarized in Table 47, i 

and the derivations and theoretical probabilities are given in Table 48. 

Insert Tables 47 and 48 about here 

As can be seen from the derivations, only one rewrite rule is 
required for the noun- phrase grammar, so no parameter is necessary for 
the choice of the rewrite rule* The only choice made for oc and (3 is 
"n", so no parameters are required for these choices either * 

Three premises have been selected, ? ’n", J, p", and "g Tt , but derivations 
are permitted from only the first of these . This is a deviation from 
standard categorial grammars in which only one premise is chosen ard all 
derivations are made from the one premise. But such a deviation was 
necessary to correspond to the phrase- structure grammar in not allowing 
adjectives and articles to precede pronouns and proper nouns. Three 
parameters were required for the three premises; is the probability of 
choosing "n" for the premise, A^ is the probability of choosing !i p" , and 
A^ is the probability of choosing :, g'\ 

The optional transformation requires one (free) parameter; is 
the probability of choosing an article and T^ = 1-T^ is the probability of 
not choosing an article. This method of generating types such as (T+A+N) 
as well as (A+N) was the simplest which could be found 0 Other means, such 
as classifying articles as adjectives, generated phrases with the 
article between the adjective(s) and noun (for example, "red and yellow 
the ball" instead of "the red and yellow ball") as well as the desired 
syntactical types. As Table 47 shows, the same obligatory transformations 
used for the phrase-structure grammars are used here. 

The variable "S" was chosen for the stopping parameter; this is a 
i /N\ x 

binomial parameter, \x)S^ (l-S^) , where x is the number of times the 

rewrite rule was applied and "N" is chosen to be some number greater than 
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TABLE k7 

The Categorial Grammar for Noun Phrases 



Rewrite rule: a ->Cx/P, P 

Premises from which derivations are possible: n 

Premises from which no derivations are possible: p, g 

Primitive categories: n 

Categories: n (noun) 

g (proper noun) 

p (pronoun) 

t (article) 

n/n (adjective) 

Obligatory transformations : 

a) If ,f n" is ,f something ,f , n/n, n -> n,n/n 

b) If n n/n ,f is "what” or "all", t,n/n,n n/n,t , n 

c) If "n/n^" is "what" or "all", t,n/n,n/n ,n n/n ,t,n/ngjn 

Optional transformation: Any statement derivable from M n H may begin with 

an article, rr t n . 

premise choice parameters 

stopping parameter 

optional transformation parameter 



Parameters : 



A i ~ k y 

W 

W 
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TABLE 48 



Derivations from the Categorial Grammar for Noun Phrases 



Type 


Derivation 


Theoretical 

Probability 


P 


g 


A 2 


G 


g 


A 3 


N 


n 


A l® S lV T 2 


A+N 


n -*n/n,n 


a 1 (N)s 1 1 s 2 n_1 t 2 


A+A+N 


n -»n/n,n -»n/n,n/n,n 


A 1 


T+N 


n + transformation 


A l(5) S lV T l 


T+A+N 


n~»u/n,n + transformation 


A l ( l )S l ls 2 N " lT l 


T+A+A+N 


n ~m/n,n -»n/n, n/n, n + transformation. 


A l(I)Si 2 S 2 N - 2 Ti 


A+A+A+l 


n -m/ n, n -m/ n, n/ n, n ~>n/ n, n/n> n/n } n 


A 1 ( 3 )S 1 5S 2 N " 5t 2 
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the maximum number of times the rule must be applied to generate all 
observed syntactical types. This a priori choice of f, N M is not entirely 
satisfactory, but as will be seen in a later discussion of chi-square 
contributions and total chi-squares, within a certain range the choice 
does not have a significant effect on the parameter estimation. If f, S^ n 
is estimated separately for typer formed according to the number of times 
the rule was applied, "S^ ,r is a maximum- likelihood estimate as shown in 
the following derivation for N =6, the value used in the application of 
the maximum- likelihood procedures as explained in Chapter III. 



Type 

1 

2 

3 

4 

5 

6 

7 



Frequency 
K 1 = f N +f T+N 
K 2 = f A+N +f T+A+N 
K 3 = f A+A+N +f T+A+A+N 



K, 



a+a+a+n 



\6 n K i 



v° 

k 6 = ° 
K ? = 0 
K„ 



Probability 

(l-sp 6 

6(i-s 1 ) 5 (s 1 ) 

15(i-s i ) 4 (s i ) 2 

20(l-S 1 ) 3 (S 1 ) 3 

15 (i-s 1 ) 2 (s 1 ) 4 

6(1-S 1 )(S 1 ) 5 

„ 6 



4 0 2-1 3 



\3 C 3n K 4 



L = [(1-S 1 ) U ] x [6(1-S 1 )^S 1 ] <i [l5(l-s i rsp] ^Op-S^Sp’] 



2 , 



log L = K 1 log[(l-S 1 ) u ]+K 2 log[6(l-S 1 ) ;? S 1 ]+K 5 log[l5(l-S 1 ) lt Sp]+ K^logpOp-S^Sj 3 ] 



2 IQ* L ~ 6K 1 
1-S, 



5Kg 



2Ki 



1-S, 



4K, 



1-S, 



3K4 



3K4 

i-s. 



^ s i "“i “l - “i “l - “i “i - “l 

Setting the derivative equal to zero and solving: 

K 2+ 2K 3+ 3K 4 

S 1 6>K +£)K +6K +6K. 

1-234 

Using the observed frequencies for the corpus as a whole (Table 50), 

= .1054. And this is the estimate obtained when the maximum- likelihood 
estimation procedures explained in Chapter III are applied. (Table 45) 

Insert Tables 49 and 50 about here 
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5259 .4185 .4332 .4582 .4438 .3959 -5425 .4657 .4555 .4555 -4555 -4555 



TABLE 50 



Observed and Expected Frequencies, Chi-Square Contributions, 
and Total Chi-Squares for each Section of the Corpus for the 
Categorial Grammar for Noun Phrases 



GINN PR E-PR I PIER 



OB SEP V. 


EXPECT. 


CHI**2 


SOURCE 


230 


230.0 


.0 


P 


254 


254.0 


.0 


G 


76 


70.6 


.4 


N 


23 


31.1 


2.1 


A+N 


ft 


5.7 


.9 


A+A+N 


121 


117.7 


.1 


T+N 


42 


51 .8 


1.8 


T +A 4 - N 


17 


9.5 


5.9 


T+A+A-rN 


1 


.6 




A+ A+A+N 


1 


1.6 




RESIDUAL 


772 


772.0 


1 1 .3 


TOTAL 



3 DEGREES OF FREEDOM 



SC0TT-F0RESMAN PR. -PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


424 


424.0 


.0 


P 


132 


132.0 


.0 


G 


33 


36.9 


.4 


N 


24 


19.4 


1.1 


A+N 


4 


4.2 




A+A+N 


36 


33.2 


.2 


T+N 


14 


17.5 


.7 


T+A+N 


5 


3.8 




T+ A+A+N 


9 


8.1 


.1 


EXPECTED FREQ 


0 


.5 




A+ A+A+N 


0 


1 .0 




RESIDUAL 


672 


672.0 


2.5 


TOTAL 



LESS THAN 5.0 




2 



DEGREES OF FREEDOM 
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TABLE 50 (continued) 



PRE-PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


654 


654.0 


.0 


P 


386 


386.0 


.0 


G 


109 


108.1 




N 


47 


50.2 


.2 


A+N 


12 


9.7 


.5 


A+A+N 


157 


150.3 


.3 


T+N 


56 


69.7 


2.7 


T+A+N 


22 


13.5 


5.4 


T+A+A+N 


1 


1 .0 




A+A+A+N 


1 


2.5 




RESIDUAL 


1444 


1444.0 


9.1 


TOTAL 






3 


DEGREES OF FREEDOM 





GINN 


PRIMER 




OBSERV. 


EXPECT. 


CMI**2 


SOURCE 


727 


727.0 


.0 


P 


286 


286.0 


.0 


G 


86 


130.4 


15.1 


N 


123 


107.3 


2.3 


A+N 


56 


36.8 


10.0 


A+A+N 


227 


170.7 


18.6 


T+N 


99 


140.4 


12.2 


T+A+N 


43 


48.1 


.5 


T+A+A+N 


17 


6.7 


15.7 


A+A+A+N 


0 


10.5 


10.5 


RESIDUAL 


1664 


1 664.0 


85.0 


TOTAL 



O 

ERIC 

iifflimiffaHaaaa 



5 



DEGREES OF FREEDOM 
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TABLE yC (continued) 



SCOTT-FORESMAN PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


784 


784.0 


.0 


P 


220 


220.0 


.0 


G 


63 


103.0 


15.5 


N 


110 


82.8 


8.9 


A+N 


43 


27.7 


8.4 


A+A+N 


153 


121 .8 


8.0 


T+N 


78 


97.9 


4.1 


T+A+N 


28 


32.8 


.7 


T+A+A+N 


3 


5.0 




A+ A+ A+ N 


3 


12.0 


6.7 


RESIDUAL 


1482 


1482.0 


52.3 


TOTAL 






4 


DEGREES OF FREEDOM 



PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


1511 


1511.0 


.0 


P 


506 


506.0 


.0 


G 


149 


233.3 


30.5 


N 


233 


190.1 


9.7 


A+N 


99 


64.6 


18.4 


A+A+N 


380 


292.5 


26.2 


T+N 


177 


238.3 


15.8 


T+A+N 


71 


80.9 


1.2 


T+A+A+N 


20 


11.7 


5.9 


A+ A+A+N 


0 


17.5 


17.5 


RESIDUAL 


3146 


3146.0 


125. 1 


TOTAL 



O 

ERIC 

ijfflimffamiaaa 



5 



DEGREES OF FREEDOM 



138 



00142 



rv 



TABLE 50 (continued) 



GINN READER 



OBSERV. 


EXPECT 


!! 43 


1 143.0 


A75 


473.0 


21 5 


296.3 


238 


210.6 


103 


62.4 


556 


452.1 


243 


321 .3 


86 


95.2 


24 


9.9 


0 


17.4 


00 

0 

1*} 


3081 .0 



CHI**2 SOURCE 



.0 


P 


.0 


G 


22.3 


N 


3.6 


A+N 


26.5 


A+A+N 


23.9 


T X N 


19.1 


T+A+N 


.9 


T+ A+A+N 


20.3 


A+ A+A+N 


17.4 


RESIDUAL 


133.9 


TOTAL 


5 


DEGREES OF 



FREEDOM 



SCOTT- FORESMAN READER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


997 


997.0 


.0 


P 


323 


323.0 


.0 


G 


239 


376.3 


50.1 


N 


370 


259.0 


47.5 


A+N 


97 


74.3 


6.9 


A+A+N 


430 


317.4 


40.0 


T+N 


148 


218.5 


22.7 


T+A+N 


31 


62.7 


16.0 


T+A+A+N 


16 


11.4 


1.9 


A+ A+A+N 


0 


11.4 


11.4 


RESIDUAL 


2651 


2651.0 


196.6 


TOTAL 



5 DEGREES OF FREEDOM 



O 

ERIC 

ijfflimffamiaaa 
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TABLE 50 (continued) 



READERS COMBINED 



| 





OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


i 

{ 


2140 


2140.0 


.0 


P 


796 


796.0 


.0 


G 




454 


671.5 


70.4 


N 


j 


60S 


470.1 


40.4 


A+N 


} 


200 


137.1 


28.8 


A+A+N 




986 


770.5 


60.3 


T+N 


; 


39! 


539.4 


40.8 


T+A+K 


1 


1 1 7 


157.4 


10.4 


T+A+A+N 




40 


21 .3 


16.3 


A+ A+A+N 


i 

.1 


0 


28.6 


28.6 


RESIDUAL 




5732 


5732.0 


296. 1 


TOTAL 


1 






5 


DEGREES OF FREEDOM 



I 



ALL COMM NED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


4305 


4305.0 


.0 


P 


1688 


1688.0 


.0 


G 


712 


101 1 .0 


88.4 


N 


888 


714.5 


42.1 


A+N 


311 


210.4 


48.1 


A+A+N 


1523 


1208.4 


81.9 


T+N 


624 


854.0 


61.9 


T+A+N 


210 


251 .5 


6.8 


T+A+A+N 


61 


33.0 


23.7 


A+ A+A+N 


0 


46.2 


46.2 


RESIDUAL 


1 0322 


10322 


399.3 


TOTAL 



5 DEGREES OF FREEDOM 



ERIC 

hfliflaffHMioaa 
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TABLE 50 (continued) 

ALL COMBINED CN=5) 



03SERV. 


EXPECT. 


CHI**2 


SOURCE 


4305 


4305.0 


.0 


P 


1688 


1688.0 


.0 


G 


712 


1003.1 


84.5 


K 


888 


726.0 


36, 1 


A+N 


311 


210.2 


48.4 


A+A+N 


1523 


1198.9 


87.6 


T+N 


624 


867,7 


68.5 


T+A+N 


210 


251 .2 


6.8 


T+A+A+N 


61 


30 o 4 


30.7 


A+ A+A+N 


0 


41 .4 


41 .4 


RESIDUAL 


10322 


10322 


403.9 


TOTAL 



5 DEGREES OF FREEDOM 



ALL COMBINED CN=7) 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


4305 


4305.0 


.0 


P 


1 688 


1688.0 


.0 


G 


712 


1016.5 


91 .2 


N 


888 


706.5 


46.6 


A+N 


311 


210.5 


48.0 


A+A+N 


1523 


1215.0 


78.1 


T+N 


624 


844.4 


57.5 


T+A+N 


210 


251 .5 


6.9 


T+A+A+N 


61 


34.8 


19.7 


A+ A+A+N 


0 


49.6 


49.6 


RESIDUAL 


10322 


10322 


397.7 


TOTAL 



5 DEGREES OF FREEDOM 
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TABLE 50 (continued) 



ALL COMBINED <N=8> 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


4305 


4305.0 


.C 


P 


1688 


1688.0 


.0 


G 


712 


1020.6 


95.5 


N 


888 


700.6 


50.1 


A+N 


31 1 


210.4 


48.1 


A+A+N 


1523 


1219.9 


75.3 


T+N 


624 


837.4 


54.4 


T+A+N 


210 


251.5 


6.9 


T+A+A+N 


61 


36.1 


17.1 


A+ A+A+N 


0 


52.2 


52.2 


RESIDUAL 


10322 


10322 


397.5 


TOTAL 



5 DEGREES OF FREEDOM 




00146 



142 

If had been chosen to be a geometric distribution, no a priori 
choices would have been necessary.. This distribution was effective in 
the work of Suppes (1970), but it would provide a poor fit in this case. 

If the geometric distribution had been used here, the parameter S 1 would 
have appeared in every theoretical probability involving "N ,? , and 
whenever one or more adjectives were included, a corresponding number 
of = 1 ” S ]_ terms would have appeared in the probability. The probability 
of type (N), for example, would have been: (A^) (b^) (l-T^) and the proba- 
bility of type (A+N) would have been: (A^) (l-S^) (3^j (i-T^) * Thus the 

grammar would always predict a greater number of (Iff) types than (A-rN) 
types, a greater number of (A+N) types than (A+A+N) types, etc. But 
the observed data did not show this trend. The Poisson distribution 
would have fit the stopping date in a manner similar to the binomial 
distribution and would not have involved an a priori choice; but the 
remaining statistical calculations would have been much more complicated, 
and for a first approximation this was not deemea necessary. 

The categoric 1 grammar for noun phrases generates the same types 
as the phrase -structure grammar, so the percents of the corpus accounted 
for by the categorial grammar are the same those shown in Table 13 » 
However, the stopping parameter allows the categorial grammar to generate 
seven types which are not found in the corpus. The stopping parameter 
allows the rule to be applied six times; thus all types from (N) to 
(A+A+A+A+A+A+N) and from (T+N) to (T+A+*+A+A+ A+A+N) are possible, although 
the probabilities of the longer types are small. The grammar generates 
nine observed types and the seven unobserved types which are placed in 
one cell; four parameters are used so without further collapsing due to 
low predicted frequencies, the model allows five degrees of freedom. 

Tables 49 and 50 show the maximum- likelihood estimates and the 
chi-square analysis for this grammar; in the miximum- likelihood calcula- 
tions 6 was used for N unless otherwise stated. On Table 5^ the column 
labeled "residual’’ includes the expected frequencies for types generated 
but not observed as well as any remaining uncollapsed types and round-off 
errors. The contributions from unlisted types is substantial but never 
as large as the largest contribution from the observed types. 
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Table 50 shows that- for different sections of the corpus the source 
of large chi-square contributions varies.. The contribution from type (N) 
is usually, but not a] ways, the largest, and no other pattern is evident 0 
For the phrase- structure grammar, on the other hand, the contributions from 
types (ll) and (T+N) were consistently the greatest- 

The last four pages of Table 50 show the chi-squares for the corpus 
as a whole for different choices of ,? N" , the number chosen in connection 
with the stopping parameter- It is apparent that the choice has little 
effect on the total chi-square or on the individual contributions; for N 
equal to 5> 7 , and 8, the total chi-square values are ^03°9> 599«3> 

397*7, and 397 • 5 respectively. Very large values of N would of course 
affect the chi-squares because these values of N would give very small 
probabilities to phrases of one to four words and large probabilities tc 
the longer, unobserved types. 

Table 51 presents a comparison of total chi-squares for each section 
of the corpus for the phrase- structure and categorial grammars. 

Insert Table 51 abo ut here 

Because of the differing degrees of freedom, F- tests were used to 
compare the chi- square s£ in this situation the independence assumption 
is violated, so the results must be considered only as approximations. 
However, no significant differences were apparent. While the grammars 
differ in their ability to predict particular phrase types (with the 
phrase- structure being more regular), the overall fit provided by each 
grammar is approximately the same. The similarity of the fit is further 
demonstrated by Figure 1 which shows the observed frequencies plotted 
against their rank order and the frequencies predicted by the phrase- 
structure and categorial grammars plotted at the rani, of the corresponding 
observed frequencies. 

Insert- Figure 7 ab out here 

Table 51 shows that the rank order of the total comparative chi-squares 
("comparative chi-square" indicates that differences in degrees of freedom 
have been considered) from the Ginn primer to the entire corpus is the same 
for the phrase- structure and categorial grammars. In both the trend of 
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Fig, 7° Comparison of observed and predicted frequencies 
for phrase -structure and categorial grammars for 
noun phrases (entire corpus). 
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increasing chi-squares from the pre-primers to the primers and from the 
primers to the readers is apparent- For both the total comparative chi- 
square for the primers and readers combined is greater than the total 
comparative chi-square for either of the individual volume s, and the 
comparative chi-square for the entire corpus is the largest of alia It is 
interesting that- for both grammars the pre-primers showed an exception 
to the rule of chi-squares for combinations being greater than chi-square 
for individual volumes. For the phrase- structure grammar the comparative 
chi-square for the Scott -Foresman pre-primer is larger than that for the 
pre-primers combined, and for the categorial grammar the Ginn pre-primer 
has a larger comparative chi-square than the pre-primers combined* 

The Categorial Grammar f o r V er b Phra ses 

The categorial grammar for verb phrases is summarized in Table 52. 

The obligatory transformations are the same as those of the phrase -structure 
grammar. 

Insert Table 52 about here 

Again only one rewrite rule (this time the second rewrite rule) is 
squired for the grammar, so no rule choice parameter is. necessary. 
Parameters A^, A^, and A^ have been used to designate the premise choices* 
For the verb phrase grammar a geometric stopping parameter, S^, is 
acceptable because the frequency of the observed phrases decreases as the 
length increases; represents the probability of stopping and - 1-S^ 
represents the probability of continuing. A geometric stopping parameter 
implies an infinite number of theoretical verb phrase types, but the 
probability of the longer types approaches zero rapidly. The choice for 
(3 is always and the choice for a when the rule is applied more than 

once is always and always the first term so no parameters are required 
for these choices <> Thus, three free parameters, A^, A^, and S^, are 
required for this model; nine observed types and an infinite number of 
unobserved types which are combined into one cell are generated, so when 
no further collapsing is necessary, six degrees of freedom are available. 

The derivations and theoretical probabilities for the verb-phrase 
grammar are shown in Table 53° 

Insert Table 53 about here 
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TABLE 52 

The Categorial Grammar for Verb Phrases 



Rewrite rule; 

Premises: s, s/n, s/[n/n] 

Primitive category: m 

Categories: s 

s/n 

s/[n/n] 
m\ s 

m\[s/n] 

m\[s/[n/n]] 

m 

m\m 



(intransitive verb) 

(transitive verb) 

(copulative verb) 

(infinitive form of intransitive verb) 
(infinitive form of transitive verb) 
(infinitive form of copulative verb) 
(modal) 

(negation) 



Obligatory transfo mat ions; / m \ s \ ,' m \s+ing 

1. If "m" is a form of "to be", )m\[s/n] ( -» ] m\s[s/n]+ing 

( m\[s/[n/n]]) (m\[s/[n/n]]+ing 

2 a. If "m\s" is a form of "to be" or "to do" and "m" is a form of "to be" 
or "to do", m,nKm,m\s -» s,m\m. 

2 b. If w s" is "will", "can", "may", "must", "shall", or "could", m,m\m, 
m\s -* s,n\m. 

3 . If "m\[s/n]" is a form of "to be" and "m" is a form of "to be" or 
"to do", m,m\m,m\[s/n] -* s/n,m\m. 

4. If "m\[s/[n/n] ] " is a form of "to be" and "m" is a form of "to be" or 
"to do", m,m\m,m\[s/[n/n]] -* s/[n/n],m\m. 

Parameters : A^-A^, premise choice parameters 

S--S , stopping parameter 

1 c- 



1. 
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TABLE 53 

Derivations from the Categorial Grammar for Verb Phrases 



Theoretical 



Type 


Derivation 


Probability 


IV 


s 


V S ! 


M+IV 


s-4 m,nXs 


A 1 ' S 2 * S 1 


M+-+IV 


s-4 m,m\s-» m,iiKm,ii\s 


A 1 ' S 2 2 * S 1 


TV 


s/n 


A 2 ’ S 1 


M+TV 


s/n-> m,m\[ s/n] 


VV B ! 


M+-+TV 


s/n~> m,m\[ s/n]-» m,Am,nK[s/n] 


A 2 ' *2 • S 1 


cv 


s/[n/n] 


A 3 ’ S 1 


M+CV 


s/[n/n]-» m,n\[s/[n/n] ] 


A 3 ’ S 2 ’ S 1 


M+-+CV 


s/[n/n]-» m,m\[s/[n/n]] -^m^aKm^A [ s/[n/n] ] 


A ^ ' S 2 " S 1 
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The percent of verb phrases in each section of the corpus accounted 
for by this model are the same as those for the phrase- structure grammar 
and are given in Table l8. The maximum- likelihood estimates and the chi- 
square analysis are given in Tables 54 and 55 • 

Insert Tables 54 and 55 about here 

As was stated earlier, the categorial grammar generates an infinite 
number cf unobserved verb phrase types all of which have very small 
probabilities; the total probabilities of unobserved types in the pre- 
primers combined, primers combined, readers combined, and all combined 
are . 004 , . 011 , . 010 , and .009 respectively,. The chi-square contributions 
from these types as well as contributions from any remaining uncollapsed 
types and round-off errors are shown in the row labeled v, Residual” in 
Table 55* As in the noun- phrase grammar, these are substantial but far 
smaller than the largest contribution from an observed type . 

Table 55 shows that except for the Ginn pre-primer (where the fit 
is consistently good) the largest contibutors to the total chi-squares 
are the (M+IV) and (M+-+TV) types. This reflects the problem faced by 
the phrase- structure grammar • The observed proportions of the use of 
modals and of modals plus negatives differ for transitive and intransitive 
verbs, but the parameter assignments do not reflect this difference. 

A comparison of the phrase- structure and categorial grammars for 
verb phrases is presented in Table 56 . 

Insert Table 56 about here 

Again F-tests were used to compare corresponding total chi-square 
values although the results are only approximations, and again no 
significant differences were apparent. The two kinds of grammars seem 
to provide equivalent fits to the observed data in each section of the 
corpus and in the corpus as a whole. Figure 8, which corresponds to 
Figure 7 for noun phrases, shows the similarity of the two fits. 

Insert Figure 8 about here 
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TABLE 55 



Observed and Expected Frequencies, Chi-Square Contributions, 
and Total Chi-Squares for each Section of the Corpus for the 
Categorial Grammar for Verb Phrases 



GINN PRE-PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


462 


453.3 


.2 


IV 


34 


48.0 


4.1 


MflV 


1 1 


5.1 


6.9 


Mf-+IV 


336 


337.9 


.0 


TV 


3B 


35.8 


,\ 


PH-TV 


4 


3.8 




Mf -+TV 


7 


8.C 


• 1 


CV 


0 


.9 




PH-CV 


2 


.1 




PH— *-CV 


6 


5.8 


.0 


RESIDUAL 


*94 


894.0 


11.4 


TOTAL 



3 DEGREES OF FREEDOM 



SCOTT- FORESMAN PR E- PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


325 


289.6 


4.3 


IV 


30 


64.2 


18.2 


PM- IV 


17 


14.2 


.5 


PH--+IV 


187 


218.7 


4.6 


TV 


68 


48.5 


7.9 


PH-TV 


26 


10.7 


21.7 


PH--+TV 


l 


3.1 




CV 


3 


.7 




PH-CV 


0 


.2 




PH--+CV 


4 


11. 1 


4.5 


RESIDUAL 


657 


657.0 


61 .8 


TOTAL 



3 DEGREES OF FREEDOM 

! 

o 
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TABLE 55 (continued) 



PRE-PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


787 


739.3 


3.1 


IV 


64 


117.5 


24.5 


Mf IV 


2* 


18.7 


4.7 


Wf-flV 


523 


554.3 


1.8 


TV 


106 


88.1 


3.6 


Mf TV 


30 


14.0 


18.3 


Mf -+TV 


8 


10.9 


.8 


CV 


3 


1.7 




Mf CV 


2 


.3 




Mf ~fCV 


5 


8.2 


1.3 


RESIDUAL 


1551 


1551.0 


57.9 


TOTAL 






4 


DEGREES OF FREEDOM 





GINN 


PRIMER 




OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


579 


543.7 


2.3 


IV 


75 


115.9 


14.4 


Mf IV 


37 


24.7 


6.1 


Mf-+I V 


455 


493.3 


3.0 


TV 


131 


105.2 


6.3 


Mf TV 


41 


22.4 


15.4 


Mf-+TV 


64 


57.4 


.8 


CV 


3 


12.2 


7.0 


Mf CV 


6 


2.6 




Mf-+CV 


6 


16.1 


€.3 


RESIDUAL 


1391 


1391 .0 


61 .6 


TOTAL 



5 DEGREES OF FREEDOM 




TABLE 55 (continued) 



I 

I 

I 

I 

( 



SCOTT- FOR ESMAN PRIMER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


459 


408.4 


6.3 


IV 


53 


99.0 


21.3 


MflV 


27 


24.0 


.4 


Mf-+I V 


428 


456.9 


1.8 


TV 


97 


110.7 


1.7 


Mf TV 


78 


26.8 


97.6 


PH--+TV 


30 


30.3 


.0 


CV 


2 


7.3 


3.9 


Mf CV 


8 


1.8 




Mf -+CV 


8 


18.6 


6.0 


RESIDUAL 


11 82 


1182.0 


139.1 


TOTAL 



5 DEGREES OF FREEDOM 



( 

I. 

1 

L 

[ 

i 

i 

i 



PRIMERS COMBINED 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


1038 


951.0 


8.0 


IV 


128 


215.7 


35.7 


Mf I V 


64 


48.9 


4.6 


Mf-+I V 


883 


951.0 


4.9 


TV 


228 


215.7 


.7 


Mf TV 


119 


48.9 


100.3 


Mf -+TV 


94 


87.4 


.5 


CV 


5 


19.8 


11.1 


Mf C V 


14 


4.5 




Mf-+CV 


14 


34.5 


12.2 


RESIDUAL 


2573 


2573.0 


177.9 


TOTAL 



5 DEGREES OF FREEDOM 



I 
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GINN READER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


901 


831.4 


5.8 


IV 


121 


193.2 


27.0 


MflV 


61 


44.9 


5.8 


PH--+IV 


585 


662.5 


9.1 


TV 


199 


153.9 


13.2 


MfTV 


79 


35.8 


52.3 


Mf-+TV 


60 


50.7 


1.7 


CV 


3 


11.8 


6.5 


MfCV 


3 


2.7 




Mf-+CV 


3 


28.0 


22.3 


RESIDUAL 


2012 


2012.0 


1 43.6 


TOTAL 



5 DEGREES OF FREEDOM 



SCOTT- FORESMAN READER 



OBSERV. 


EXPECT. 


CHI**2 


SOURCE 


755 


714.5 


2.3 


IV 


85 


136.3 


19.3 


MflV 


43 


26.0 


11.1 


Mf-+I V 


716 


717.7 


.0 


TV 


109 


137.0 


5.7 


Mf TV 


62 


26.1 


49.2 


Mf-+TV 


42 


50.2 


1.3 


CV 


12 


9.6 


.6 


MfCV 


8 


1.8 




Mf-+CV 


8 


14.5 


2.9 


RESIDUAL 


1832 


1832.0 


92.5 


TOTAL 



5 DEGREES OF FREEDOM 
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READERS COMBINED 



09SERV. 


EXPECT. 


CHI**2 


SOURCE 


! 6^6 


1547.0 


7.7 


IV 


206 


529.7 


46.4 


Mf I V 


104 


70.5 


16.2 


Mf~fIV 


150! 


1577.1 


4.2 


TV 


50? 


295.5 


.7 


MfTV 


141 


62.5 


98.5 


Mf -+TV 


102 


100.7 


.0 


CV 


15 


21.5 


1.9 


Mf CV 


11 


4.6 




Mf-+CV 


1 1 


41 .8 


22.7 


RESIDUAL 


3844 


5844.0 


198.5 


TOTAL 



5 DEGREES OF FREEDOM 





ALL 


COMBINED 




09SERV. 


EXPECT. 


CHI**2 


SOURCE 


3481 


3228.6 


19.7 


IV 


598 


670.6 


110.8 


Mf I V 


196 


139.3 


23.1 


Mf-+IV 


2707 


2883.1 


10.8 


TV 


642 


598.9 


3.1 


Mf TV 


990 


124.4 


220.5 


Mf-H-TV 


204 


201.2 


.0 


CV 


25 


41.8 


8.5 


Mf CV 


27 


8.7 


38.6 


Mf -f CV 


0 


71.4 


71.4 


RESIDUAL 


7968 


7968.0 


506.5 


TOTAL 



6 DEGREES OF FREEDOM 
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Figo 8, Comparison of observed and predicted frequencies 
for phrase-structure and categorial grammars for 
verb phrases (entire corpus) « 




158 00162 

The two categorial grammars which have been presented in this 
chapter provide approximately the same degree of fit to the data as the 
corresponding phrase- structure grammars given in Chapter III. If categorial 
grammars can, as indicated by this work, be as representative of the 
corpus as phrase- structure grammars, future work might profit from the 
construction of mainly categorial grammars. The advantages of easier 
adaptability may outweigh the disadvantages of unfamiliar terminology. 
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CHAPTER V 

SUMMARY AND CONCLUSIONS 

This study has provided a syntactical analysis of two widely used 
first grade readers — the Scott-Foresman series and the Ginn series . The 
analysis was presented in terms of six phrase- structure grammars, each of 
which provides a probabilistic description of the phrases it represents. 

The six grammars- -a noun-phrase grammar, a verb-phrase grammar, a 
verbal-modifier grammar , a grammar for statements without verbs, an 
interrogative grammar , and a grammar for statements with verbs — were 
necessary because of the extremely large number of utterance types contained 
in the corpus. A single grammar containing all syntactic details would 
have had to account for many type frequencies which were too small for 
statistical analysis. For purposes of comparison categorial grammars were 
written for the noun phrases and the verb phrases. 

The study has demonstrated a quantitative method for evaluating 
linguistic grammars. The method provides a theoretical framework to account 
for the utterances of the corpus and their frequencies of occurrence, and 
provides a quantitative measure for judging the fit of a grammar to a 
corpus and for comparing the fit of two different grammars to the same 
corpus. Parameters attached to the choice points of a grammar can be 
used to form the theoretical probability of J w syntactical type derivable 
from the grammar. The parameters are estimsjMpby methods of maximum- 
likelihood, and the estimated probabilities provide theoretical frequencies 
for each syntactical type. Chi-square tests are then used to determine the 
goodness-of-fit of the grammar to the corpus. 
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Both the Scott-Foresman and the Ginn first-grade reading series 
are divided into a pre-primer * a primer, and a first reader. All of the 
grammars accounted for a high percent of their respective phrases or 
utterances in each section of the corpus. The grammar for noun phrases, 
for example, accounted for 98*6 percent of the pre- primers, 98.2 percent 
of the primers, 98.0 percent of the readers, and 98.2 percent of the 
entire corpus; the respective percents of the grammar for statements with 
verbs were 91*2, 87. 5 y 8l.8, and 85-5* Very few types not found in the 
corpus were generated by the phrase- structure grammars, and these had low 
probabilities; unobserved types were generated by the categorial grammars, 
and these too had very low probabilities. 

Whenever the observed frequencies were sufficiently large, the 
analysis was performed on each section of each series and on the 
corresponding combined sections, as well as on the corpus as a whole. In 
each case the same grammar was used, but the parameters were reestimated, 
new theoretical type frequencies were obtained, and corresponding chi-square 
values were computed. For the noun-phrase grammars (both phrase- structure 
and categorial) the chi-square values increased from the pre-primers to the 
primers and from the primers to the readers indicating that these grammars 
provide the best fits at the lower reading leveles. A second trend for 
the noun- phrase grammars was larger chi-square values for the combined 
sections (e.g., for the Scott-Foresman and Ginn .primers combined) 
than for either individual section. This reflects a difference in 
proportional noun- phrase frequencies in the two series. Neither of these 
trends was present in the verb-phrase grammars or in the grammar for 
statements with verbs; in these grammars there were no significant 
differences among sections of the corpus with the e; eption of an extremely 
poor fit of the statement grammar to the pre-primers combined. Individual 
section analyses were not possible for the other three grammars. 



many small contributions implying that overall the probabilistic framework 
was quite representative. However, in constructing the grammars a large 
number of parameters was used; the number of degrees of freedom was small 
relative to the size of the chi-squares, and some sort of fit would be 



In general the grammars provided 
total chi-squares were generally compos 
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expected. It is hoped that in further investigations grammars which 
require a smaller number of parameters will be written . 

The best fit for the phrase- structure grammars was provided by the 
verbal-modifier grammar, and the fit of the grammar for statements without 
verbs was almost equivalent. The fit of the interrogative grammar and 
that of the verb-phrase grammar to their respective parts of the corpus 
were roughly equivalent and better than the fit of the noun- phrase grammar. 
The verb-phrase grammar provided a better fit than the noun-phrasr 
grammar for all sections of the corpus, but the difference was significant 
only in the case of the Scott-Foresman reader and the readers combined. 

For the grammar for statements with verbs, the fits to the primers 
individually and combined and to the Ginn reader were not significantly 
different than those of the noun-phrase grammar for corresponding 
sections. The fit to the pre-primers combined was significantly worse; 
and the fits to the Scott-Foresman reader, the readers combined, and the 
corpus as a whole were significantly better than those given by the 
noun- phrase grammars ; the latter were equivalent to those of the verb-phrase 
grammars . 

The categorial grammars were approximately equivalent to che phrase- 
structure grammars in their ability to represent the corpus. For future 
work categorial grammars might be a better basis for analysis than phrase- 
structure grammars even though they involve less familiar terminology. 

Any work which involves the comparison of one corpus to another- -chi Id 
speech to child readers, for example- -requires a grammar which is easily 
adaptable. To adapt a categorial grammar to a new corpus, a few new 
categories may be necessary, but no new rewrite rules are needed; while 
new parameters for category choices might be needed, the assignment of the 
basic rule choice and stopping parameters would be unchanged. In contrast, 
to adapt a phrase- structure grammar to a new j^orpus many of the rewrite 
rules themselves would have to be changed; tills* would involve a 
reassignment of parameters which, in turn, feujd make final comparisons 

TV 

more difficult. 
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This dissertation has made a contribution to the problem of finding 
an appropriate match of reading materials to the reader « Previous work 
Bormuth (1964); Ruddell (1964, 1965) j Strickland (1963) has indicated a 
definite relationship between the ease of reading and comprehension to 
the similarity of written material and the spoken language of the 
reader. This study has demonstrated a method for a concise representation 
of syntactic patterns and their frequencies in a corpus and has suggested 
a quantitative analysis to indicate exactly how accurate the representation 
is. It has provided such a representation for two widely used first-grade 
readers, the Scott-Foresman series and the Ginn series. It is hoped that 
further investigation with special emphasis on reducing the number of 
parameters and including the most infrequent patterns will improve the 
analysis; this work should involve experimentation with other kinds of 
generative grammars as well as revisions of the grammars which have been 
used here. A later step is to similarly analyze the speech of first 
graders using these texts--some who are having reading difficulty and 
some who are not — to see what comparisons and what differences exist. 

The findings may indicate some changes in the reading material, particularly 
for those who are having trouble, which would increase the similarity 
of the sentence structure of the material to that of the children 1 s speech. 
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